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OXALOACETATE HYDROLASE DEFICIENT FUNGAL HOST CELLS 

Background of the Invention 

Field of the Invention 

The invention relates to isolated nucleic acid sequences 
encoding polypeptides having oxaloacetate hydrolase activity. 
The present invention further relates to mutant host cells, in 
particular fungal mutant host cells such as cells of the genus 
Aspergillus, deficient in oxaloacetate hydrolase activity and 
thereby in oxalic acid production. The present invention also 
relates to the use of the mutant cells for producing desirable 
compounds, such as polypeptides, primary and secondary 
metabolites, and to methods of producing such compounds in the 
mutant cells of the invention. The invention also relates to 
nucleic acid constructs, vectors, and host cells comprising the 
nucleic acid sequences as well as recombinant methods for 
producing the polypeptides having oxaloacetate hydrolase 
activity. 

Description of the Related Art 

Filamentous fungi are widely used for the commercial 
production of a variety of compounds of interest, including 
homologous compounds, such as primary or secondary metabolites 
and polypeptides normally produced by the fungus . in question or 
heterologous compounds, such as heterologous polypeptides 
encoded by foreign DMA which has been introduced into the 
fungus in question. Such products are produced by fermentation 
of the fungus in question and harvest of the desired product 
resulting from the fermentation. 

The fungal species A. niger is widely used for the 
commercial production of desired compounds, eg citric acid and 
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industrial enzymes. It is well-known that this species produces 
large amounts of oxalic acid. For a number of reasons the 
production of oxalic acid is undesirably when this species is 
used for commercial production of a compound of interest. For 
instance, the production of oxalic acid requires a lot of 
carbon and thus extra, expensive carbon sources must be added 
to the fermentation medium compared to what would be required 
only for the production of the desired compound; the presence 
of oxalic acid in the fermentation broth causes problems in the 
downstream processing involved in the recovery of the product 
of interest because oxalic acid forms a precipitate with 
calcium which interferes in the recovery; and oxalic acid is a 
toxic compound which means that its presence is considered a 
major problem in the production of food grade products from A. 
niger . 

Two possible routes for the pathway for biosynthesis of 
oxalic acid in A. niger have been suggested. The first route is 
oxaloacetate + water -> Oxalic acid + acetate, the reaction 
being catalyzed by oxaloacetate hydrolase (Kubicek, C. P., G. 
Schreferl-Kunar, W. Wohrer and M. Rohr (1988) Appl . Environ. 
Microbiol. 54,633-637). The second route involves the 
glycoxylate pathway (Balmforth, A. J. , a. Thomson (1984): 
Biochem. J. 218 113-118) . 

It has been attempted to control the formation of oxalic 
acid during commercial A. niger fermentations by conducting the 
fermentation at a low pH where only little oxalic acid is 
formed. However, fermentation at low pH may be undesirable 
since normally this pH is not optimal for the growth of A. 
niger and yield of a desired fermentation product. 

A partially purified oxaloacetate hydrolase from A. niger 
has been described (Lenz et al . , Partial purification and some 
properties of oxaloacetate from Aspergillus niger, 1976, Eur. 
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J . Biochem. 65: 225-236), but the gene encoding this enzyme has 
not been described. 

It is an object of the. present invention to provide 
mutants of cells, such as filamentous fungal cells, in 
5 particular cells of A. niger, which is deficient in 
oxaloacetate hydrolase production and thereby oxalic acid 
production . 

Summary of the Invention 

10 

The present invention relates to isolated nucleic acid 
sequences encoding polypeptides having oxaloacetate hydrolase 
activity, selected from the group consisting of: 

(a) a nucleic acid sequence encoding a polypeptide having 
is an amino acid sequence which has at least 65% identity 

with the amino acid sequence of the oxaloacetate 
hydrolase of SEQ ID NO: 2; 

(b) a nucleic acid sequence having at least 65% homology 
with the coding part of the DNA sequence SEQ ID NO : 1 

20 (constituted by nucleotides 1157-1411, 1504-1651 and 

1764-2383 of SEQ ID NO : 1 ) 

(c) a nucleic acid sequence which hybridizes under medium 
stringency conditions with (i) the nucleic acid sequence of SEQ 
ID NO:l, (ii) the cDNA sequence of SEQ ID NO : 1 , (iii) a 

25 subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) 
a complementary strand of (i) , (ii) , or (iii) ; 

(d) an allelic variant of (a) , (b) , or (c) ; and 

(e) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has 

30 oxaloacetate hydrolase activity. 

In a further important aspect the invention relates to a 
method for producing a mutant of a cell, which comprises 
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disrupting or deleting the nucleic acid sequence of the 
invention or *a control sequence thereof, which results in the 
mutant producing less of the oxaloacetate hydrolase and thus 
oxalic acid than the cell. The present invention also relates 
to a mutant produced by this method. 

The present invention also relates to nucleic acid 
constructs, vectors, and host cells comprising the nucleic acid 
sequences as well as recombinant methods for producing the 
polypeptides . 



Brief Description of the Figures 

Figure 1 shows the construction of plasmid pHP3 described 
in Example 4 . 

Detailed Description of the Invention 

Isolated Nucleic Acid Sequences Encoding Polypeptides Having 
oxaloacetate hydrolase Activity 

The term "oxaloacetate hydrolase activity" is defined 
herein as an activity which catalyzes the reaction: 
oxaloacetate + water -> Oxalic acid + acetate. The enzyme is 
classified as belonging to EC 3.7.1.1. For purposes of the 
present invention, oxaloacetate hydrolase activity is 
determined according to the procedure described in the 
Materials and Methods section further below. One unit of 
oxaloacetate hydrolase activity is defined as 1.0 ^mole of 
oxalic acid produced per minute at 30°C, pH 7.5. 

The term "isolated nucleic acid sequence" as used herein 
refers to a nucleic acid sequence which is essentially free of 
other nucleic acid sequences, e.g., at least about 20% pure, 
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preferably at least about 40% pure, more preferably at least 
about 60% pure, even more preferably at least about 80% pure, 
and most preferably at least about 90% pure as determined by 
agarose electrophoresis. For example, an isolated nucleic acid 
sequence can be obtained by standard cloning procedures used in 
genetic engineering to relocate the nucleic acid sequence from 
its natural location to a different site where it will be 
reproduced. The cloning procedures may involve excision and 
isolation of a desired nucleic acid fragment comprising the 
nucleic acid sequence encoding the polypeptide, insertion of 
the fragment into a vector molecule, and incorporation of the 
recombinant vector into a host cell where multiple copies or 
clones of the nucleic acid sequence will be replicated. The 
nucleic acid sequence may be of genomic,. cDNA, RNA, 
semisynthetic, synthetic origin, or any combinations thereof. 

In a first embodiment, the present invention relates to 
isolated nucleic acid sequences encoding polypeptides having an 
amino acid sequence which has a degree of identity to amino 
acids 1 to 341 of SEQ ID NO:2 (i.e., the mature polypeptide) of 
at least about 65%, preferably at least about 70%, more 
preferably at least about 80%, even more preferably at least 
about 90%, most preferably at least about 95%, and even most 
preferably at least about 97%, which have oxaloacetate 
hydrolase activity (hereinafter "homologous polypeptides") . In 
a preferred embodiment, the homologous polypeptides have an 
amino acid sequence which differs by five' amino acids, 
preferably by four amino acids, more preferably by three amino 
acids, even more preferably by two amino acids, and most 
preferably by one amino acid from amino acids 1 to 341 of SEQ 
ID M0:2. For purposes of the present invention, the degree of 
identity between two amino acid sequences is determined by the 
Clustal method (Higgins, 1989, CABIOS 5: 151-153) using the 
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LASERGENE™ MEGALIGN™ software (DNASTAR, Inc., Madison, WI ) with 
an identity table and the following multiple alignment 
parameters: Gap penalty of 10, and gap length penalty of 10. 
Pairwise alignment parameters were Ktuple=l, gap penalty=3 , 
windows=5, and diagonals=5 . 

Preferably, the nucleic acid sequences of the present 
invention encode polypeptides that comprise or consist of the 
amino acid sequence of SEQ ID NO : 2 or an allelic variant 
thereof; or a fragment thereof that has oxaloacetate hydrolase 
activity. 

The present invention also encompasses nucleic acid 
sequences which encode a polypeptide having the amino acid 
sequence of SEQ ID NO : 2 , which differ from SEQ ID NO : 1 by 
virtue of the degeneracy of the genetic code. The present 
invention also relates to subsequences of SEQ ID NO:l which 
encode fragments of SEQ ID NO: 2 which have oxaloacetate 
hydrolase activity or which are sufficiently long to be used 
for inactivation of an oxaloacetate hydrolase gene in a 
microbial cell (as described in the section entitled "Removal 
or Reduction of Oxaloacetate Hydrolase Activity"). 

A subsequence of SEQ ID NO:l is a nucleic acid sequence 
encompassed by SEQ ID NO : 1 except that one or more nucleotides 
from the 5 1 and/or 3' end have been deleted. Preferably, a 
subsequence contains at least 2800 nucleotides, more preferably 
at least 3000 nucleotides, and most preferably at least 3200 
nucleotides. A fragment of SEQ ID NO : 2 is a polypeptide having 
one or more amino acids deleted from the amino and/or carboxy 
terminus of this amino acid sequence. Preferably, a fragment 
contains at least 2 70 amino acid residues, more preferably at 
least 300 amino acid residues, and most preferably at least 320 
amino acid residues. 

An allelic variant denotes any of two or more alternative 
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forms of a gene occupying the same chomosomal locus. Allelic 
variation arises naturally through mutation, and may result in 
polymorphism within populations. Gene mutations can be silent 
(no change in the encoded polypeptide) or may encode 
polypeptides having altered amino acid sequences. The allelic 
variant of a polypeptide is a polypeptide encoded by an allelic 
variant of a gene . 

The amino acid sequences of the homologous polypeptides 
may differ from the amino acid sequence of SEQ ID NO : 2 by an 
insertion or deletion of one or more amino acid residues and/or 
the substitution of one or more amino acid residues by 
different amino acid residues. Preferably, amino acid changes 
are of a minor nature, that is conservative amino acid 
substitutions that do not significantly affect the folding 
and/or activity of the protein; small deletions, typically of 
one to about 3 0 amino acids; small amino- or carboxyl -terminal 
extensions, such as an amino- terminal methionine residue; a 
small linker peptide of up to about 20-25 residues; or a small 
extension that facilitates purification by changing net charge 
or another function, such as a poly-hist idine tract, an 
antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the 
group of basic amino acids (such as arginine, lysine and 
hist idine) , acidic amino acids (such as glutamic acid and 
aspartic acid) , polar amino acids (such as glutamine and 
asparagine) , hydrophobic amino acids (such as leucine, 
i so leucine and valine) , aromatic amino acids (such as 
phenylalanine, tryptophan and tyrosine) , and small amino acids 
(such as glycine, alanine, serine, threonine and methionine) . 
Amino acid substitutions which do not generally alter the 
specific activity are known in the art and are described, for 
example, by H. Neurath and R.L. Hill, 1979, In, The Proteins, 
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Academic Press, New York. The most commonly occurring 

exchanges are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, 
Ala/Thr, Ser/Asn, Ala/Val , Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, 
Asp/Asn, Leu/Ile, Leu/Val , Ala/Glu, and Asp/Gly as well as 
these in reverse . 

In a second embodiment, the present invention relates to 
isolated nucleic acid sequences which have a degree of homology" 
to the mature polypeptide coding sequence of SEQ ID NO:l 
(constituted by nucleotides 1157-1411, 1504-1651 and 1764-2383 
of SEQ ID NO:l) of at least about 65%, preferably about 70%, 
preferably about 8 0%, more preferably about 90%, even more 
preferably about 95%, and most preferably about 97% homology, 
which encode an active polypeptide or which are sufficiently 
long to be used for inactivation of an oxaloacetate hydrolase 
gene in a microbial cell (as described in the section entitled 
"Removal or Reduction of Oxalolacetate Hydrolase Activity"); or 
allelic variants and subsequences of SEQ ID NO:l which encode 
polypeptide fragments which have oxaloacetate hydrolase 
activity or which are sufficiently long to be used for 
inactivation of an oxaloacetate hydrolase gene in a microbial 
cell (as described in the section entitled "Removal or 
Reduction of Oxaloacetate Hydrolase Activity"). For purposes 
of the present invention, the degree of homology between two 
nucleic acid sequences is determined by the computer program 
Align provided in the Fasta program package (Version v20u6) , 
using GAP with the following settings for nucleotide sequence 
comparison: GAP creation penalty (for the first residue) in a 
GAP of -16 and GAP extension (for the additional residues) 
penalty of -4. Align is a slightly modified version of a 
program taken from E. Myers and W. Miller. The algorithm is 
described in E. Myers and W. Miller, "Optimal Alignments in 
Linear Space" (CABIOS (1988) 4:11-17). 
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In a third embodiment, the present invention relates to 
isolated nucleic acid sequences encoding polypeptides having 
oxaloacetate hydrolase activity which hybridize under medium 
stringency conditions, more preferably medium-high stringency 
5 conditions, even more preferably high stringency conditions, 
and most preferably very high stringency conditions with a 
nucleic acid probe which hybridizes under the same conditions 
with (i) the nucleic acid sequence of SEQ ID NO : 1 , (ii) the 
cDNA sequence of SEQ ID NO : 1 , (iii) a subsequence of (i) or 

10 (ii) , or (iv) a complementary strand of (i) , (ii) , or (iii) (J. 
Sambrook, E.F. Fritsch, and T. Maniatus, 198 9, Molecular 
Cloning, A Laboratory Manual, 2d edition, Cold Spring Harbor, 
New York) . The subsequence of SEQ ID NO : 1 may be at least 100 
nucleotides or preferably at least 200 nucleotides.. Moreover, 

is the subsequence may encode a polypeptide fragment which has 
oxaloacetate hydrolase activity. 

The nucleic acid sequence of SEQ ID NO: 1 or a subsequence 
thereof, as well as the amino acid sequence of SEQ ID NO: 2 or a 
fragment thereof, may be used to design a nucleic acid probe to 

20 identify and clone DNA encoding polypeptides having 
oxaloacetate hydrolase activity from strains of different 
genera or species according to methods well known in the art. 
In particular, such probes can be used for hybridization with 
the genomic or cDNA of the genus or species of interest, 

25 following standard Southern blotting procedures, in order to 
identify and isolate the corresponding gene therein. Such 
probes can be considerably shorter than the entire "sequence, 
but should be at least 15, preferably at least 25, and more 
preferably at least 3 5 nucleotides in length. Longer probes 

30 can also be used. Both DNA and RNA probes can be used. The 
probes are typically labeled for detecting the corresponding 
gene (for example, with 32 P, 3 H, 35 S, biotin, or avidin) . Such 
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probes are encompassed by the present invention. 

Thus, a genomic DNA or cDNA library prepared from such 
other organisms may be screened for DNA which hybridizes with 
the probes described above and which encodes a polypeptide 
having oxaloacetate hydrolase activity. Genomic or other DNA 
from such other organisms may be separated by agarose or 
polyacrylamide gel electrophoresis, or other separation 
techniques. DNA from the libraries or the separated DNA may be 
transferred to and immobilized on nitrocellulose or other 
suitable carrier material. In order to identify a clone or DNA 
which is homologous with SEQ ID NO : 1 or a subsequence thereof, 
the carrier material is used in a Southern blot. For purposes 
of the present invention, hybridization indicates that the 
nucleic acid sequence hybridizes to a nucleic acid probe 
corresponding to the nucleic acid sequence shown in SEQ ID 
NO:l, its complementary strand, or a subsequence thereof, under 
very low to very high stringency conditions. Molecules to 
which the nucleic acid probe hybridizes under these conditions 
are detected using X-ray film. 

In a preferred embodiment, the nucleic acid probe is a 
nucleic acid sequence which encodes the polypeptide of SEQ ID 
NO: 2, or a subsequence thereof. In another preferred 

embodiment, the nucleic acid probe is SEQ ID NO:l. In another 
preferred embodiment, the nucleic acid probe is the nucleotides 
encoding the fragment of SEQ ID NO: 2 constituted by amino acid 
residues 123-205 or a part of this fragment. In another 
preferred embodiment, the nucleic acid probe is the nucleic 
acid sequence contained in plasmid pHPl which is contained in 
Escherichia coli DSM-12660 wherein the nucleic acid sequence 
encodes a polypeptide having oxaloacetate hydrolase activity, 
in particular the mature polypeptide coding region of SEQ ID 
NO: 1 . 
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For long probes of at least 100 nucleotides in length, 
very low to very high stringency conditions are defined as 
prehybridization and hybridization at 42 °C in 5X SSPE, 0.3% 
SDS, 2 00 i-ig/ml sheared and denatured salmon sperm DNA, and 
5 either 25% formamide for very low and low stringencies, 35% 
formamide for medium and medium-high stringencies, or 50% 
formamide for high and very high stringencies, following 
standard Southern blotting procedures. 

For long probes of at least 100 nucleotides in length, the 

10 carrier material is finally washed three times each for 15 
minutes using 2 x SSC, 0.2% SDS preferably at least at 45°C 
(very low stringency) , more preferably at least at 50°C (low 
stringency) , more preferably at least at 55 °C (medium 
stringency) , more preferably at least at 60°C (medium-high 

15 stringency) , even more preferably at least at 65 °C (high 
stringency) , and most preferably at least at 70°C (very high 
stringency) . 

For short probes which are about 15 nucleotides to about 
70 nucleotides in length, stringency conditions are defined as 
20 prehybridization, hybridization, and washing post -hybridization 
at 5°C to 10°C below the calculated T m using the calculation 
according to Bolton and McCarthy (1962, Proceedings of the 
National Academy of Sciences USA 48:1390) in 0.9 M NaCl , 0.09 M 
. Tris-HCl pH 7.6, 6 mM EDTA, 0.5% NP-40, IX Denhardt ' s solution, 
25 1 mM sodium pyrophosphate, 1 mM sodium monobasic phosphate, 0.1 
mM ATP, and 0.2 mg of yeast RNA per ml following standard 
Southern blotting procedures. 

For short probes which are about 15 nucleotides to about 
70 nucleotides in length, the carrier material is washed once 
30 in 6X SCC plus 0.1% SDS for 15 minutes and twice each for 15 
minutes using 6X SSC at 5°C to 10°C below the calculated T m . 

The present invention also relates to isolated nucleic 
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acid sequences produced by (a) hybridizing a DNA under medium, 
medium-high, high, or very high stringency conditions with the 
sequence of SEQ ID NO : 1 , or its complementary strand, or a 
subsequence thereof; and (b) isolating the nucleic acid 
sequence. The subsequence is preferably a sequence of at least 
100 nucleotides such as a sequence which encodes a polypeptide 
fragment which has oxaloacetate hydrolase activity or a 
sequence which is sufficiently long to be used for inactivation 
of an oxaloacetate hydrolase gene in a microbial cell (as 
described in the section entitled "Removal or Reduction of 
Oxaloacetate Hydrolase Activity" ) . . 

The polypeptides encoded by the isolated nucleic acid 
sequences of the present invention have at least 2 0%, 
preferably at least 40%, more preferably at least 60%, even 
more preferably at least 80%, even more preferably at least 
90%, and most preferably at least 100% of the oxaloacetate 
hydrolase activity of the mature polypeptide of SEQ ID NO:2. 

The nucleic acid sequences of the present invention may be 
obtained from microorganisms of any genus. For purposes of the 
present invention, the term "obtained from" as used herein in 
connection with a given source shall mean that the polypeptide 
encoded by the nucleic acid sequence is produced by the source 
or by a cell in which the nucleic acid sequence from the source 
has been inserted. 

The nucleic acid sequences may be obtained from a 
bacterial source. For example, these polypeptides may be 
obtained from a gram positive bacterium such as a Bacillus 
strain, e.g., Bacillus alkalophilus , Bacillus 

amyloliquefaciens, Bacillus brevis, Bacillus circulans, 
Bacillus coagulans, Bacillus lautus, Bacillus lentus, Bacillus 
lichenif ormis, Bacillus megaterium, Bacillus 

stearothermophilus, Bacillus subtilis, or Bacillus 
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thuringiensis; or a Streptomyces strain, e.g., Streptomyces 
lividans or Streptomyces murinus; or from a gram negative 
bacterium, e.g., E. coli or Pseudomonas sp . 

In a preferred embodiment, the nucleic acid sequence is 
obtained from a fungal cell. "Fungi" as used herein includes 
the phyla Ascomycota, Basidiomycota, Chytridiomycota , and 
Zygomycota (as defined by Hawksworth et al . , In, Ainsworth and 
Bisby's Dictionary of The Fungi, 8th edition, 1995, CAB 
International, University Press, Cambridge, UK) as well as the 
Oomycota (as cited in Hawksworth et al . , 1995, supra, page 171) 
and all mitosporic fungi (Hawksworth et al . , 1995, supra). 

In a more preferred embodiment, the fungal cell is a yeast 
cell . "Yeast" as used herein includes ascosporogenous yeast 
(Endomycetales) , basidiosporogenous yeast, and yeast belonging 
to the Fungi Imperfecti (Blastomycetes ) . Since the 

classification of yeast may change in the future, for the 
purposes of this invention, yeast shall be defined as described 
in Biology and Activities of Yeast (Skinner, F.A., Passmore, 
S.M., and Davenport, R.R., eds, Soc . App. Bacterid. Symposium 
Series No. 9, 1980) . 

More specifically, the nucleic acid sequences may be 
obtained from a yeast strain such as a Candida, Hansenula, 
Kluyveromyces , Pichia, Saccharomyces , Schizosaccharomyces , or 
Yarrowia strain; or more preferably from a filamentous fungal 
strain such as an Acremonium, Aspergillus, Aureobasidium, 
Cryptococcus , Filibasidium, Fusarium, Humicola, Magnaporthe, 
Mucor, Myceliophthora, Neocallimastix, Neurospora, 

Paecilomyces , Penici Ilium, Piromyces, Schizophyllum, 

Talaromyces, Thermoascus, Thielavia, Tolypocladium, or 
Trichoderma strain . 

In a preferred embodiment, the nucleic acid sequences are 
obtained from a Saccharomyces carlsbergensis , Saccharomyces 
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cerevisiae, Saccharomyces diastaticus , Saccharomyces douglasii, 
Saccharomyces kluyveri, Saccharomyces norbensis or 

Saccharomyces oviformis strain. 

In another preferred embodiment, the nucleic acid 
sequences are obtained from an Aspergillus aculeatus,-" 
Aspergillus awamori , Aspergillus foetidus, Aspergillus 
japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus" 
oryzae, Fusarium bactridioides , Fusarium cerealis, Fusarium 
crookwellense, Fusarium culmorum, Fusarium graminearum, 
Fusarium graminum, Fusarium heterosporum, Fusarium negundi , 
Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, 
Fusarium sambucinum, Fusarium sarcochroum, Fusarium 

sporotrichioides, Fusarium sulphureum, Fusarium torulosum, 
Fusarium trichothecioides , Fusarium venenatum, Humicola 
insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora 
thermophila, Neurospora crassa, Penicillium purpurogenum, 
Trichoderma harzianum, Trichoderma koningii, Trichoderma 
longibrachiatum, Trichoderma reesei, or Trichoderma viride 
strain. 

In a more preferred embodiment, the nucleic acid sequences 
are obtained from Aspergillus niger, and most preferably from 
A. niger BOl DSM 12665, e.g., the nucleic acid sequence set 
forth in SEQ ID NO : 1 . In another more preferred embodiment, 
the nucleic acid sequence is the sequence contained in plasmid 
pHPl which is contained in Escherichia coli DSM- 12660 . In 
another preferred embodiment, the nucleic acid sequence is a 
nucleotide sequence constituted by nucleotides 1157-1411, 1504- 
1651 and 1764-2383 of SEQ ID NO : 1 , which encodes the mature 
polypeptide of SEQ ID NO:2. 

It will be understood that for the aforementioned species, 
the invention encompasses both the perfect and imperfect 
states, and other taxonomic equivalents, e.g., anamorphs, 
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regardless of the species name by which they are known. Those 
skilled in the art will readily recognize the identity of 
appropriate equivalents. For example, the polypeptides may be 
obtained from microorganisms which are taxonomic equivalents of 
5 Aspergillus niger as defined by Benn and Klich (Benn J. W and 
M. A. . Klich (1992) Aspergillus, Biology and industrial 
applications. Butterworth - Heinemann, USA) , regardless of the 
species name by which they are known, e.g. A. aculeatus, A. 
awamori , A. carboniarus, A. ellipticus, A. ficuum, A. foetidus, 

10 A. heteromorphus , A. japonicus, A. phoenicis, A. pulverulentus , 
A. tubingensis, A. helicothrix, A. atroviolaceus , A. citricus, 
A. acidicus or A. f onsecaeus . 

Strains of these species are readily accessible to the 
public in a number of culture collections, such as the American 

15 Type Culture Collection (ATCC) , Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH (DSM) , Centraalbureau 
Voor Schimmelcultures (CBS) , and Agricultural Research Service 
Patent Culture Collection, Northern Regional Research Center 
(NRRL) . 

20 Furthermore, such nucleic acid sequences may be identified 

and obtained from other sources including microorganisms 
isolated from nature (e.g., soil, composts, water, etc.) using 
the above-mentioned probes. Techniques for isolating 

microorganisms from natural habitats are well known in the art. 

25 The nucleic acid sequence may then be derived by similarly 
screening a genomic or cDNA library of another microorganism. 
Once a nucleic acid sequence encoding a polypeptide has been 
detected with the probe (s), the sequence may be isolated or 
cloned by utilizing techniques which are known to those of 

30 ordinary skill in the art (see, e.g., Sambrook et al . , 1989, 
supra) . 

The techniques used to isolate or clone a nucleic acid 
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sequence encoding a polypeptide are known in the art and 
include isolation from genomic DNA, preparation from cDNA, or a 
combination thereof. The cloning of the nucleic acid sequences 
of the present invention from such genomic DNA can be effected, 
e.g., by using the well known polymerase chain reaction (PCR) 
or antibody screening of expression libraries to detect cloned 
DNA fragments with shared structural features. See, e.g., 
Innis et al . , 1990, PCR: A Guide to Methods and Application, 
Academic Press, New York. For PCR it may be of particular 
relevance to use a set of primers spanning the nucleotide 
sequence encoding amino acids 123-205 of SEQ ID NO : 2 . Other 
nucleic acid amplification procedures such as ligase chain 
reaction (LCR) , ligated activated transcription (LAT) and 
nucleic acid sequence -based amplification (NASBA) may be used. 

The nucleic acid sequence may be cloned from a strain of 
Aspergillus, or another or related organism and thus, for 
example, may be an allelic or species variant of the 
polypeptide encoding region of the nucleic acid sequence. 

Modification of a nucleic acid sequence of the present 
invention may be necessary for the synthesis of polypeptides 
substantially similar to the polypeptide. The term 

"substantially similar" to the polypeptide refers to non- 
naturally occurring forms of the polypeptide. These 
polypeptides may differ in some engineered way from the 
polypeptide isolated from its native source, e.g., variants 
that differ in specific activity, thermostability, pH optimum, 
or the like. The variant sequence may be constructed on the 
basis of the nucleic acid sequence presented as the polypeptide 
encoding part of SEQ ID NO:l, e.g., a subsequence thereof, 
and/or by introduction of nucleotide substitutions which do not 
give rise to another amino acid sequence of the polypeptide 
encoded by the nucleic acid sequence, but which corresponds to 
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the codon usage of the host organism intended for production of 
the enzyme, or by introduction of nucleotide substitutions 
which may give rise to a different amino acid sequence. For a 
general description of nucleotide substitution, see, e.g., Ford 
et al . , 1991, Protein Expression and Purification 2: 95-107. 

It will be apparent to those skilled in the art that such 
substitutions can be made outside the regions critical to the 
function of the molecule and still result in an active 
polypeptide. Amino acid residues essential to the activity of 
the polypeptide encoded by the isolated nucleic acid sequence 
of the invention, and therefore preferably not subject to 
substitution, may be identified according to procedures known 
in the art, such as site-directed mutagenesis or alanine- 
scanning mutagenesis (see, e.g., Cunningham and Wells, 1989, 
Science 244: 1081-1085). In the latter technique, mutations 
are introduced at every positively charged residue in the 
molecule, and the resultant mutant molecules are tested for 
oxaloacetate hydrolase activity to identify amino acid residues 
that are critical to the activity of the molecule. Sites of 
substrate -enzyme interaction can also be determined by analysis 
of the three-dimensional structure as determined by such 
techniques as nuclear magnetic resonance analysis, 
crystallography or photoaf f inity labelling (see, e.g., de Vos 
et al. , 1992, Science 255: 306-312; Smith et al . , 1992, Journal 
of Molecular Biology 224: 899-904; Wlodaver et al . , 1992, FEBS 
Letters 309: 59-64). 

Methods for Producing Mutant Nucleic Acid Sequences 

The present invention further relates to methods for 
producing a mutant nucleic acid sequence, comprising 
introducing at least one mutation into the mature polypeptide 
coding sequence of SEQ ID NO : 1 or a subsequence thereof, 
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wherein the mutant nucleic acid sequence encodes a polypeptide 
which consists of amino acids 1 to 341 of SEQ ID NO: 2 or a 

fragment thereof which has oxaloacetate hydrolase activity. 

The introduction of a mutation into the nucleic acid 
sequence to exchange one nucleotide for another nucleotide may-' 
be accomplished by site-directed mutagenesis using any of the 
methods known in the art. Particularly useful is the procedure 
which utilizes a supercoiled, double stranded DNA vector with 
an insert of interest and two synthetic primers containing the 
desired mutation. The oligonucleotide primers, each 

complementary to opposite strands of the vector, extend during 
temperature cycling by means of Pfu DNA polymerase. On 
incorporation of the primers, a mutated plasmid containing 
staggered nicks is generated. Following temperature cycling, 
the product is treated with Dpnl which is specific for 
methylated and hemimethylated DNA to digest the parental DNA 
template and to select for mutation- containing synthesized DNA. 

Other procedures known in the art may also be used. 

Removal or Reduction of Oxaloacetate Hydrolase Activity 

The present invention also relates to methods for 
producing a mutant cell of a parent cell, which comprises 
disrupting or deleting a nucleic acid sequence of the present 
invention or a control sequence thereof, which results in the 
mutant cell producing less of the polypeptide having 
oxaloacetate hydrolase activity encoded by the nucleic acid 
sequence than the parent cell and thus less oxalic acid when 
cultivated under the same conditions. 

The construction of strains that have reduced oxaloacetate 
hydrolase activity and oxalic acid production may be 
conveniently accomplished by modification or inactivation of a 
nucleic acid sequence necessary for expression of the 
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polypeptide having oxaloacetate hydrolase activity in the cell. 

The nucleic acid sequence to be modified or inactivated may 
be, for example, a nucleic acid sequence encoding. the 
polypeptide or a part thereof essential for exhibiting 
oxaloacetate hydrolase activity, eg a nucleic acid sequence of 
the invention as described in the section entitled "Isolated 
Nucleic Acid Sequences Encoding Polypeptides Having Oxaloactate 
Hydrolase Activity" , or the nucleic acid sequence may have a 
regulatory function required for the expression of the 
polypeptide from the coding sequence of the nucleic acid 
sequence. An example of such a regulatory or control sequence 
may be a promoter sequence or a functional part thereof, i.e., 
a part which is sufficient for affecting expression of the 
polypeptide. Other control sequences for possible modification 
are described further in this section. 

Modification or inactivation of the nucleic acid sequence 
may be performed by subjecting the cell to mutagenesis and 
selecting or screening for cells in which the oxaloacetate 
hydrolase producing capability has been reduced. The 
mutagenesis, which may be specific or random, may be performed, 
for example, by use of a suitable physical or chemical 
mutagenizing agent, by use of a suitable oligonucleotide, or by 
subjecting the DNA sequence to PCR generated mutagenesis. 
Furthermore, the mutagenesis may be performed by use of any 
combination of these mutagenizing agents. 

Examples of a physical or chemical mutagenizing agent 
suitable for the present purpose include ultraviolet (UV) 
irradiation, hydroxyl amine , N-methyl-N' -nitro-N- 

nitrosoguanidine (MNNG) , 0-methyl hydroxyl amine , nitrous acid, 
ethyl methane sulphonate (EMS) , sodium bisulphite, formic acid, 
and nucleotide analogues. 

When such agents are used, the mutagenesis is typically 
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performed by incubating the cell to be mutagenized in the 
presence of the mutagenizing agent of choice under suitable 
conditions, and selecting for cells exhibiting reduced 
oxaloacetate hydrolase activity or production. The reduced or 
eliminated oxaloacetate hydrolase activity may be determined by 
use of the assay described in the Examples section hereinafter. 

Modification or inactivation of production of a 
polypeptide encoded by a nucleic acid sequence of the present 
invention may be accomplished by introduction, substitution, or 
removal of one or more nucleotides in the nucleic acid sequence 
encoding the polypeptide or a regulatory element required for 
the transcription or translation thereof. For example, 

nucleotides may be inserted or removed so as to result in the 
introduction of a stop codon, the removal of the start codon, 
or a change of the open reading frame. Such modification or 
inactivation may be accomplished by site-directed mutagenesis 
or PCR generated mutagenesis in accordance with methods known 
in the art. Although, in principle, the modification may be 
performed in vivo, i.e., directly on the cell expressing the 
nucleic acid sequence to be modified, it is preferred that the 
modification be performed in vitro as exemplified below. 

An example of a convenient way to eliminate' • or reduce 
production by a host cell of choice is based on techniques of 
gene replacement or gene interruption. For example, in the 
gene interruption method, a nucleic acid sequence corresponding 
to the endogenous gene or gene fragment of interest is 
mutagenized in vitro to produce a defective nucleic acid 
sequence which is then transformed into the host cell to 
produce a defective gene. By homologous recombination, the 
defective nucleic acid sequence replaces the endogenous gene or 
gene fragment. It may be desirable that the defective gene or 
gene fragment also encodes a marker which may be used for 
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selection of transformants in which the gene encoding the 
polypeptide has been modified or destroyed. 

Alternatively, modification or inactivation of the nucleic 
acid sequence may be performed by established anti-sense 
5 techniques using a nucleotide sequence complementary to the 
polypeptide encoding sequence. More specifically, production 
of the polypeptide by a cell may be reduced or eliminated by 
introducing a nucleotide sequence complementary to the nucleic 
acid sequence encoding the polypeptide which may be transcribed 

10 in the cell and is capable of hybridizing to the polypeptide 
mRNA produced in the cell. Under conditions allowing the 
complementary ant i -sense nucleotide sequence to hybridize to 
the polypeptide mRNA, the amount of polypeptide translated is 
thus reduced or eliminated. 

15 It is preferred that the cell to be modified in accordance 

with the methods of the present invention is of microbial 
origin, in particular of any of the species listed in the 
section above entitled "Isolated Nucleic Acid Sequences 
Encoding Polypeptides Having Oxaloactate Hydrolase Activity" as 

20 sources for the nucleic acid sequence of the invention encoding 
a polypeptide with oxaloacetate hydrolase activity. Preferably, 
the cell is a fungal strain which is suitable for the 
production of desired products such as polypeptides or primary 
or secondary metabolites, either homologous or heterologous to 

25 the cell. Even more preferably, the cell is a cell of 
Aspergillus, in particular A. niger. 

The present invention further relates to a mutant cell of 
a parent cell which comprises a disruption or deletion of a 
nucleic acid sequence encoding the polypeptide or a control 

30 sequence thereof, which results in the mutant cell producing 
less of the polypeptide than the parent cell. 

The polypeptide-def icient mutant cells so created are 
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particularly useful as host cells for the expression of 
homologous and/or heterologous expression products, such as 
homologous or heterologous polypeptides or primary or secondary 
metabolites. Therefore, the present invention further relates 
to methods for producing a homologous or heterologous product" 
comprising (a) cultivating the mutant cell under conditions, 
conducive for production of the product; and (b) recovering the 
product . 

The methods used for cultivation and purification of the 
product of interest may be performed by methods known in the 
art, eg as described further below in the section entitled 
"Methods of Production" . 

The methods of the present invention for producing an 
essentially oxalic acid free product is of particular interest 
in the production of primary metabolites, in particular food 
grade products such as citric acid and other Krebs cyclus 
acids, and homologous or heterologous polypeptides, in 
Particular eukaryotic polypeptides. 

The polypeptide may be any polypeptide heterologous or 
homologous to the mutant cell. The term "polypeptide" is not 
meant herein to refer to a specific length of the encoded 
product, and therefore, encompasses peptides, oligopeptides, 
and proteins. The term "heterologous polypeptide" is defined 
herein as a polypeptide that is not native to the mutant cell, 
a native (or protein in which modifications have been made to 
alter the native sequence, or a native protein whose expression 
is quantitatively altered as a result of a manipulation of the 
mutant cell by recombinant DNA techniques. The mutant cell may 
contain one or more copies of the nucleic acid sequence 
encoding the homologous or heterologous polypeptide. In a 
preferred embodiment, the heterologous polypeptide is an 
extracellularly secreted polypeptide. 
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Preferably, the polypeptide to be produced is a 
hormone, hormone variant, enzyme, receptor or portion thereof, 
antibody or portion thereof, or reporter. The enzyme may be 
selected from, e.g., an amylolytic enzyme, lipolytic enzyme, 
proteolytic enzyme, cellulytic enzyme, oxidoreductase , or plant 
cell -wall degrading enzyme. Examples of such enzymes include 
an aminopeptidase , amylase, amyloglucosidase , carbohydrase , 
carboxypeptidase, catalase, cellulase, chitinase, cutinase, 
cyclodextrin glycosyltransf erase , deoxyribonuclease , esterase , 
galactosidase , beta-galactosidase , glucoamylase , glucose 
oxidase, glucosidase, haloperoxidase , hemicellulase , invertase, 
isomerase, laccase, ligase, lipase, lyase, mannosidase, 
oxidase, pectinolytic enzyme, peroxidase, phytase, 

phenoloxidase , polyphenoloxidase , proteolytic enzyme, 

ribonuclease , transferase, transglutaminase, or xylanase. The 
oxaloacetate hydrolase-def icient cells may also be used to 
express heterologous proteins of pharmaceutical interest such 
as hormones, growth factors, receptors, and the like. 

The nucleic acid sequence encoding a heterologous 
polypeptide that can be expressed in a mutant cell' of the 
invention, in particular a filamentous fungal mutant cell, may 
be obtained from any prokaryotic, eukaryotic, or other source. 

For purposes of the present invention, the term u obtained 
from' 7 as used herein in connection with a given source shall 
mean that the polypeptide is produced by the source or by a 
cell in which a gene from the source has been inserted. 

In the methods of the present invention, the mutant cell, 
in particular a mutant filamentous fungal cell may also be used 
for the recombinant production of polypeptides or other 
products such as primary or secondary metabolites that are 
native (or "homologous") to the cell. 

The techniques used to isolate or clone a nucleic acid 
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sequence encoding a heterologous polypeptide are known in the 
art and include isolation from genomic DNA, preparation from 
cDNA, or a combination thereof. The cloning of the nucleic 
acid sequence from such genomic DMA can be effected, e.g., by 
using the well known polymerase chain reaction (PCR) . See, for - " 
example, Innis et al . , 1990, PCR Protocols: A Guide to Methods, 
and Application, Academic Press, New York. The cloning 
procedures may involve excision and isolation of a desired 
nucleic acid fragment comprising the nucleic acid sequence 
encoding the polypeptide, insertion of the fragment into a 
vector molecule, and incorporation of the recombinant vector 
into the mutant cell where multiple copies or clones of the 
nucleic acid sequence will be replicated. The nucleic acid 
sequence may be of genomic, cDNA, RNA, semisynthetic, synthetic 
origin, or any combinations thereof. 

In the methods of the present invention, heterologous 
polypeptides may also include fused or hybrid polypeptides in 
which another polypeptide is fused at the N-terminus or the C- 
terminus of the polypeptide or fragment thereof. A fused 
polypeptide is produced by fusing a nucleic acid sequence (or a 
portion thereof) encoding one polypeptide to a nucleic acid 
sequence (or a portion thereof) encoding another polypeptide. 
Techniques for producing fusion polypeptides are known in the 
art, and include, ligating the coding sequences encoding the 
polypeptides so that they are in frame and expression of the 
fused polypeptide is under control of the same promoter (s) and 
terminator. The hybrid polypeptides may comprise a combination 
of partial or complete polypeptide sequences obtained from at 
least two different polypeptides wherein one or more may be 
heterologous to the mutant cell. 

An isolated nucleic acid sequence encoding a heterologous 
polypeptide of interest may be manipulated in a variety of ways 
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to provide for expression of the polypeptide. Expression will 
be understood to include any step involved in the production of 
the polypeptide including, but not limited to, transcription, 
post -transcriptional modification, translation, post- 

5 translational modification, and secretion. Manipulation of the 
nucleic acid sequence prior to its insertion into a vector may 
be desirable or necessary depending on the expression vector. 
The techniques for modifying nucleic acid sequences utilizing 
cloning methods are well known in the art. 

io "Nucleic acid construct" is defined herein as a nucleic 

acid molecule, either single- or double-stranded, isolated from 
a naturally occurring gene or modified to contain segments of 
nucleic acid that are combined and juxtaposed in a manner which 
would not otherwise exist in nature. The term nucleic acid 

is construct is synonymous with the term expression cassette when 
the nucleic acid construct contains all the control sequences 
required for expression of a coding sequence. The term "coding 
sequence" as defined herein is a sequence that is transcribed 
into mRNA and translated into a polypeptide. The boundaries of 

20 the coding sequence are generally determined by the ATG start 
codon located just upstream of the open reading frame at the 5' 
end of the mRNA and a transcription terminator sequence located 
just downstream of the open reading frame at the 3' end of the 
mRNA. A coding sequence can include, but is not limited to, 

25 genomic, cDNA, RNA, semisynthetic, synthetic, recombinant, or 
any combinations thereof. The coding sequence of the nucleic 
acid construct described herein may be the nucleotide sequence 
of the invention encoding a polypeptide with oxaloacetate 
activity (as defined in the section above entitled "Isolated 

30 Nucleic Acid Sequences Encoding Polypeptides Having 
Oxaloacetate Hydrolase Activity) in which case the nucleic acid 
construct is used for producing a polypeptide with oxaloacetate 
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activity as defined in said section or other manipulation 
involving said nucleic acid sequence; or the coding sequence 
may be one encoding a heterologous polypeptide to be produced 
in a mutant cell of the invention. 

The term "control sequences" is defined herein to include* 
all components that are necessary or advantageous for the . 
expression of a heterologous polypeptide. Each control 

sequence may be native or foreign to the nucleic acid sequence 
encoding the polypeptide. Such control sequences include, but 
are not limited to, a leader, polyadenylation sequence, 
propeptide sequence, promoter, signal peptide sequence, and 
transcription terminator. At a minimum, the control sequences 
include a promoter, and transcriptional and translational stop 
signals. The control sequences may be provided with linkers 
for the purpose of introducing . specific restriction sites 
facilitating ligation of the control sequences with the coding 
region of the nucleic acid sequence encoding a heterologous 
polypeptide. The term "operably linked" is defined herein as a 
configuration in which a control sequence is appropriately 
placed at a position relative to the coding sequence of the DNA 
sequence such that the control sequence directs the production 
of a heterologous polypeptide. 

The control sequence may be an appropriate promoter 
sequence, a nucleic acid sequence that is recognized by a cell, 
in particular a filamentous fungal cell for expression of the 
nucleic acid sequence. The promoter sequence contains 

transcriptional control sequences that mediate the expression 
of the heterologous polypeptide. The promoter may be any 
nucleic acid sequence that shows transcriptional activity in 
the cell including mutant, truncated, and hybrid promoters, and 
may be obtained from genes encoding extracellular or 
intracellular polypeptides either homologous or heterologous to 
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the cell. 

Examples of suitable promoters for directing the 
transcription of the nucleic acid constructs in a filamentous 
fungal cell in the methods of the present invention are 
5 promoters obtained from the genes encoding Aspergillus oryzae 
TAKA amylase, Rhizomucor miehei aspartic proteinase, 
Aspergillus niger neutral alpha-amylase, Aspergillus niger acid 
stable alpha-amylase, Aspergillus niger or Aspergillus awamori 
glucoamylase (glaA) , Rhizomucor miehei lipase, Aspergillus 

10 oryzae alkaline protease, Aspergillus oryzae triose phosphate 
isomerase, Aspergillus nidulans acetamidase, Aspergillus oryzae 
acetamidase (amdS) , Fusarium oxysporum trypsin-like protease 
(U.S. Patent No. 4,288,627), and mutant, truncated, and hybrid 
promoters thereof. Particularly preferred promoters are the 

15 NA2-tpi (a hybrid of the promoters from the genes encoding 
Aspergillus niger neutral alpha-amylase and Aspergillus oryzae 
triose phosphate isomerase) , glucoamylase, and TAKA amylase 
promoters . 

The control sequence may also be a suitable transcription 
20 terminator sequence, a sequence recognized by the host cell in 
question to terminate transcription. The terminator sequence 
is operably linked to the 3' terminus of the nucleic acid 
sequence encoding the heterologous polypeptide. Any terminator 
that is functional in the host cell may be used in the present 
25 invention. 

Preferred terminators functional in filamentous fungal 
cells are obtained from the genes encoding Aspergillus oryzae 
TAKA amylase, Aspergillus niger glucoamylase, Aspergillus 
nidulans anthranilate synthetase, Aspergillus niger alpha - 
30 glucosidase, and Fusarium oxysporum trypsin-like protease. 

The control sequence may also be a suitable leader 
sequence, a nontranslated region of a mRNA that is important 
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for translation by the cell. The leader sequence is operably 
linked to the 5' terminus of the nucleic acid sequence encoding 
the heterologous polypeptide. Any leader sequence that is 
functional in the cell may be used in the present invention. 

Preferred leaders are obtained from the genes encoding- 
Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose 
phosphate isomerase. 

The control sequence may also be a polyadenylation 
sequence, a sequence operably linked to the 3' terminus of the 
nucleic acid sequence and, when transcribed, is recognized by a 
cell as a signal to add polyadenosine residues to transcribed 
mRNA. Any polyadenylation sequence that is functional in the 
cell may be used in the present invention. 

Preferred polyadenylation sequences functional in 
filamentous fungal cells are obtained from the genes encoding 
Aspergillus oryzae TAKA amylase, Aspergillus niger 
glucoamylase, Aspergillus nidulans anthranilate synthase, and 
Aspergillus niger alpha-glucosidase . 

The control sequence may also be a signal peptide coding 
region that codes for an amino acid sequence linked to the 
amino terminus of the heterologous polypeptide and directs the 
encoded polypeptide into the cell's secretory pathway. The 5' 
end of the coding sequence of the nucleic acid sequence may 
inherently contain a signal peptide coding region naturally 
linked in translation reading frame with the segment of the 
coding region that encodes the secreted polypeptide. 
Alternatively, the 5' end of the coding sequence may contain a 
signal peptide coding region that is foreign to the coding 
sequence. The foreign signal peptide coding region may be 
required where the coding sequence does not naturally contain a 
signal peptide coding region. Alternatively, the foreign 
signal peptide coding region may simply replace the natural 
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signal peptide coding region in order to enhance secretion of 
the polypeptide. The signal peptide coding region may be 
obtained from a glucoamylase or an amylase gene from an 
Aspergillus species, or a lipase or proteinase gene from a 
5 Rhizomucor species. However, any signal peptide coding region 
that directs the expressed heterologous polypeptide into the 
secretory pathway of a cell may be used in the present 
invention. 

An effective signal peptide coding region in a filamentous 

10 fungal cell is the signal peptide coding region obtained from 
the genes encoding Aspergillus oryzae TAKA amylase, Aspergillus 
niger neutral amylase, Rhizomucor miehei aspartic proteinase 
gene, and Humicola lanuginosa cellulase. 

The control sequence may also be a propeptide coding 

15 region that codes for an amino acid sequence positioned at the 
amino terminus of a polypeptide. The resultant polypeptide is 
known as a proenzyme or propolypept ide (or a zymogen in some 
cases) . A propolypeptide is generally inactive and can be 
converted to a mature, active polypeptide by catalytic or 

20 autocatalytic cleavage of the propeptide from the 
propolypeptide. The propeptide coding region may be obtained 
from the genes encoding Rhizomucor miehei aspartic proteinase 
and Myceliophthora thermophila laccase (WO 95/33836) . 

Where both signal peptide and propeptide regions are 

25 present at the amino terminus of a polypeptide, the propeptide 
region is positioned next to the amino terminus of the 
polypeptide and the signal peptide region is positioned next to 
the amino terminus of the propeptide region. 

The nucleic acid constructs may also comprise one or more 

30 nucleic acid sequences that encode one or more factors that are 
advantageous for directing the expression of the heterologous 
polypeptide, e.g., a transcriptional activator (e.g., a trans- 
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acting factor), chaperone, and processing protease. Any factor 
that is functional in the host cell, in particular in a 
filamentous fungal cell may be used in the present invention. 
The nucleic acids encoding one or more of these factors are not 
necessarily in tandem with the nucleic acid sequence encoding 
the heterologous polypeptide. 

It may also be desirable to add regulatory sequences that 
allow the regulation of the expression of the heterologous 
polypeptide relative to the growth of the cell. Examples of 
regulatory systems are those that cause the expression of the 
gene to be turned on or off in response to a chemical or 
physical stimulus, including the presence of a regulatory 
compound. The TAKA alpha-amylase promoter, Aspergillus niger 
glucoamylase promoter, and Aspergillus oryzae glucoamylase 
promoter may be used as regulatory sequences in filamentous 
fungal cells. Other examples of regulatory sequences are those 
that allow for gene amplification, e.g., the metallothionein 
genes which are amplified with heavy metals. In these cases, 
the nucleic acid sequence encoding the heterologous polypeptide 
would be operably linked with the regulatory sequence. 

The various nucleic acid and control sequences described 
above may be joined together to produce a recombinant 
expression vector that may include one or more convenient 
restriction sites to allow for insertion or substitution of the 
nucleic acid sequence encoding the heterologous polypeptide at 
such sites. Alternatively, the nucleic acid sequence encoding 
the heterologous polypeptide may be expressed by inserting the 
sequence or a nucleic acid construct comprising the sequence 
into an appropriate vector for expression. m creating the 
expression vector, the coding sequence is located in the vector 
so that the coding sequence is operably linked with the 
appropriate control sequences for expression, and possibly 
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secretion. 

The recombinant expression vector may be any vector (e.g., 
a plasmid or virus) that can be conveniently subjected to 
recombinant DNA procedures and can bring about the expression 
5 of the nucleic acid sequence encoding the heterologous 
polypeptide. The choice of the vector will typically depend on 
the compatibility of the vector with the cell into which the 
vector is to be introduced. The vector may be a linear or 
closed circular plasmid. The vector may be an autonomously 
10 replicating vector, i.e., a vector that exists as an 
extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g., a plasmid, an 
extrachromosomal element, a minichromosome , or an artificial 
chromosome. The vector may contain any means for assuring 
is self -replication. Alternatively, the vector may be one that, 
when introduced into the cell, is integrated into the genome 
and replicated together with the chromosome (s) into which it 
has been integrated. The vector system may be a single vector 
or plasmid or two or more vectors or plasmids that together 
20 contain the total DNA to be introduced into the genome of the 
cell, or a transposon. - 

The vector preferably contains one or more selectable 
markers that permit easy selection of transformed cells. A 
selectable marker is a gene the product of which provides for 
25 biocide or viral resistance, resistance to heavy metals, 
prototrophy to auxotrophs, and the like. A selectable marker 
for use in a filamentous fungal host cell may be selected from 
the group including, but not limited to, amdS (acetamidase) , 
argB (ornithine carbamoyltransf erase ) , bar (phosphinothricin 
30 acetyltransf erase) , hygB (hygromycin phosphotransferase) , niaD 
(nitrate reductase), pyrG (orotidine-5 ' -phosphate 

decarboxylase) , sC (sulfate adenyltransf erase) , and trpC 
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(anthranilate synthase), as well as equivalents thereof. 
Preferred for use in a filamentous fungal cell are the amds and 
pyrG genes of Aspergillus nidulans or Aspergillus oryzae and 
the bar gene of Streptomyces hygroscopicus . 

The vector preferably contains an element (s) that permits- 
stable integration of the vector into a cell genome or 
autonomous replication of the vector in the cell independent of" 
the genome of the cell. 

"Introduction" means introducing a vector comprising the 
nucleic acid sequence into a cell so that the vector is 
maintained as a chromosomal integrant or as a self -replicating 
extra -chromosomal vector. Integration is generally considered 
to be an advantage as the nucleic acid sequence is more likely 
to be stably maintained in the cell. Integration of the vector 
into the chromosome occurs by homologous recombination, non- 
homologous recombination, or transposition. 

The introduction of an expression vector into a cell may 
involve a process consisting of protoplast formation, 
transformation of the protoplasts, and regeneration of the cell 
wall in a manner known per se . Suitable procedures for 
transformation of Aspergillus host cells are described in EP 
238 023 and Yelton et al . , 1984, Proceedings of the National 
Academy of Sciences USA 81: 1470-1474. A suitable method of 
transforming Fusarium species is described by Malardier et al . , 
1989, Gene 78: 147-156 or in WO 96/00787. 

For integration into the genome of a cell, the vector may 
rely on the nucleic acid sequence encoding the heterologous 
polypeptide or any other element of the vector for stable 
integration of the vector into the genome by homologous or 
nonhomologous recombination. Alternatively, the vector may 
contain additional nucleic acid sequences for directing 
integration by homologous recombination into the genome of the 
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cell. The additional nucleic acid sequences enable the vector 
to be integrated into the genome at a precise location (s) in 
the chromosome (s) . To increase the likelihood of integration 
at a precise location, the integrational elements should 
5 preferably contain a sufficient number of nucleic acids, such 
as 100 to 1,500 base pairs, preferably 400 to 1,500 base pairs, 
and most preferably 800 to 1,500 base pairs, which are highly 
homologous with the corresponding target sequence to enhance 
the probability of homologous recombination. The integrational 

10 elements may be any sequences that are homologous with the 
target sequence in the genome of the cell. Furthermore, the 
integrational elements may be non-encoding or encoding nucleic 
acid sequences . On the other hand, the vector may be 
integrated into the genome of the cell by non- homologous 

is recombination. 

For autonomous replication, the vector may further 
comprise an origin of replication enabling the vector to 
replicate autonomously in the filamentous fungal cell in 
question . 

20 It will be understood that the methods of the present 

invention are not limited to a particular order for obtaining 
the mutant cell. The modification of a gene involved in the 
production of a polypeptide with oxaloacetate hydrolase 
activity may be introduced into the parent cell at any step in- 

25 the construction of the cell for the production of a 
heterologous polypeptide. It is preferable that the cell has 
already been made oxaloacetate hydrolase-def icient using the 
methods of the present invention prior to the introduction of a 
gene encoding a heterologous polypeptide. 

30 The procedures used to ligate the elements described 

herein to construct the recombinant expression vectors are well 
known to one skilled in the art (see, e.g., J. Sambrook, E . F . 
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Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory 
Manual, 2d edition, Cold Spring Harbor, New York) . 

The present invention also relates to methods for 
obtaining oxaloacetate hydrolase-def icient mutant cells, in 
particular filamentous fungal mutant cells which comprise (a) • 
introducing into a parent cell, in particular a parent 
filamentous fungal cell a first nucleic acid sequence" 
comprising a modification of at least one of the genes involved 
in the production of a oxaloacetate hydrolase and a second 
nucleic acid sequence encoding a heterologous polypeptide; and 
(b) identifying the mutant from step (a) comprising the 
modified nucleic acid sequence, wherein the mutant cell 
produces less of the oxaloacetate hydrolase than the parent 
cell of the mutant cell when cultured under the same 
conditions . 

The present invention also relates to oxaloacetate 
hydrolase-def icient mutants of cells, in particular filamentous 
fungal cells for producing a heterologous polypeptide which 
comprise a first nucleic acid sequence comprising a 
modification of at least one of the genes involved in the 
production of a polypeptide with oxaloacetate hydrolase 
activity and a second nucleic acid sequence encoding the 
heterologous polypeptide, wherein the mutant produces less of 
the polypeptide with oxaloacetate hydrolase activity than the 
parent cell of the mutant cell when cultured under the same 
conditions . 

In another aspect of the present invention, the mutant 
cell may additionally contain modifications of one or more 
nucleic acid sequences which encode proteins that may be 
detrimental to the production, recovery, and/or application of 
the desired product such as the heterologous polypeptide of 
interest. The modification reduces or eliminates expression of 
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the one or more third nucleic acid sequences resulting in a 
mutant cell with a modified third nucleic acid sequence which 
may produce more of the heterologous polypeptide than the 
mutant cell without the modification of the third nucleic acid 
sequence when cultured under the same conditions. The third 
nucleic acid sequence may encode any protein or enzyme. For 
example, the enzyme may be an . aminopept idase , amylase, 
carbohydrase , carboxypept idase , catalase, cellulase, chitinase, 
cutinase , cyclodextrin glycosyl transferase , deoxyribonuclease , 
esterase , alpha - gal actos idase , beta - gal actos idase , 

glucoamylase , alpha-glucosidase , beta-glucosidase , invertase, 
laccase, lipase, mannosidase, mutanase, oxidase, a pectinolytic 
enzyme, peroxidase, phospholipase , phytase, polyphenoloxidase , 
proteolytic enzyme, ribonuclease , transglutaminase, and 
xylanase . The third nucleic acid sequence preferably encodes a 
proteolytic enzyme, e.g., an aminopept idase , a 

carboxypept idase , or a protease. 

In a further aspect, the present invention relates to a 
protein product essentially free from oxaloacetate hydrolase 
activity which is produced by a method of the present 
invention. 

The present invention also relates to nucleic acid 
constructs, recombinant expression vectors, and host cells 
containing the nucleic acid sequence of SEQ ID NO:l, 
subsequences or homologues thereof (as defined in the section 
entitled "Isolated Nucleic Acid Sequences Encoding Polypeptides 
Having Oxaloacetate Hydr.olase Activity"), for expression of the 
sequences. The constructs and vectors may be constructed as 
described herein. The host cell may be any cell suitable for 
the expression the nucleic acid sequence, in particular any of 
the cells mentioned in the section entitled "Isolated Nucleic 
Acid Sequences Encoding Polypeptides Having Oxaloacetate 
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Hydrolase Activity" as a source for the nucleic acid sequences 
of the invention, in particular, the host cell is a filamentous 
fungal cell, such as a cell of Aspergillus, such as A. niger 
or A. oryzae or a cell of Fusarium. . 

Fungal cells may be transformed by a process involving- 
protoplast formation, transformation of the protoplasts, and 
regeneration of the cell wall in a manner known per se." 
Suitable procedures for transformation of Aspergillus host 
cells are described in EP 238 023 and Yelton et al . , 1984, 
Proceedings of the National Academy of Sciences USA 81: 1470- 
1474. Suitable methods for transforming Fusarium species are 
described by Malardier et al . , 1989, Gene 78: 147-156 and WO 
96/00787. Yeast may be transformed using the procedures 
described by Becker and Guarente, In Abelson, J.N. and Simon, 
M.I., editors, Guide to Yeast Genetics and Molecular Biology, 
Methods in Enzymology, Volume 194, pp 182-187, Academic Press, 
Inc., New York; Ito et al . , 1983, Journal of Bacteriology 153 s 
163; and Hinnen et al . , 1978, Proceedings of the National 
Academy of Sciences USA 75: 1920. 

Methods of Production 

The present invention also relates to methods for 
producing a polypeptide encoded by a nucleotide sequence of the 
invention comprising (a) cultivating a host cell harbouring a 
nucleotide sequence of the invention under conditions suitable 
for production of the polypeptide; and (b) recovering the 
polypeptide . 

The present invention also relates to methods for 
producing a polypeptide of the present invention comprising (a) 
cultivating a host cell under conditions conducive for 
production of the polypeptide, wherein the host cell comprises 
a mutant nucleic acid sequence having at least one mutation in 
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the mature polypeptide coding region of SEQ ID NO:l, wherein 
the mutant nucleic acid sequence encodes a polypeptide which 
consists of amino acids 1 to 341 of SEQ ID NO : 2 , and (b) 
recovering the polypeptide. 

In the production methods of the present invention, the 
cells are cultivated in a nutrient medium suitable for 
production of the polypeptide using methods known in the art. 
For example, the cell may be cultivated by shake flask 
cultivation, small-scale or large-scale fermentation (including 
continuous, batch, fed-batch, or solid state fermentations) in 
laboratory or industrial fermenters performed in a suitable 
medium and under conditions allowing the polypeptide to be 
expressed and/or isolated. The cultivation takes place in a 
suitable nutrient medium comprising carbon and nitrogen sources 
and inorganic salts, using procedures known in the art. 
Suitable media are available from commercial suppliers or may 
be prepared according to published compositions (e.g., in 
catalogues of the American Type Culture Collection) . If the 
polypeptide is secreted into the nutrient medium, the 
polypeptide can be recovered directly from the medium. If the 
polypeptide is not secreted, it can be recovered from cell 
lysates . 

The polypeptides may be detected using methods known in 
the art that are specific for the polypeptides. These 
detection methods may include use of specific antibodies, 
formation of an enzyme product, or disappearance of an enzyme 
substrate. For example, an enzyme assay may be used to 
determine the activity of the polypeptide as described herein. 

The resulting polypeptide may be recovered by methods 
known in the art. For example, the polypeptide may be 

recovered from the nutrient medium by conventional procedures 
including, but not limited to, centrif ugation, filtration, 
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extraction, spray-drying, evaporation, or precipitation. 

The polypeptides may be purified by a variety of 
procedures known in the art including, but not limited to, 
chromatography (e.g., ion exchange, affinity, hydrophobic, 
chromatofocusing, and size exclusion), electrophoretic 
procedures (e.g., preparative isoelectric focusing), 
differential solubility (e.g., ammonium sulfate precipitation), 
SDS-PAGE, or extraction (see, e.g., Protein Purification, J.-c. 
Janson and Lars Ryden, editors, VCH Publishers, New York, 
1989) . 

Uses 

The present invention is also directed to methods of using 
the polypeptides having oxaloacetate hydrolase activity. 

The polypeptides of the present invention may be used as a 
diagnostic enzyme, e.g. for the detection of oxalic acid in 
food or other products . 

The nucleotide sequences of the present invention may be 
used for modification of the production of oxaloacetate 
hydrolase and thus oxalic by a cell, such as a microbial cell 
normally producing the hydrolase. In particular, the nucleotide 
sequences may be used to reduce or eliminate oxaloacetate 
hydrolase and thus oxalic acid production of the cell in 
question. 

The present invention is further described by the 
following examples which should not be construed as limiting 
the scope of the invention. 



Examples 



Chemicals used as buffers and substrates were commercial 
products of at least reagent grade. 
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Strains: 

Aspergillus niger BO 1 (DSM 12665) 
5 Escherichia coli DH 5a D. M . Woodcock et al . (1989) Nucleic 
Acids Res. 17, 3469-3478 

Media and assays 

10 Buffer A: 50 mM Tris/HCl pH 7.5, 2mM MnCl 2 , 20 mM DTT, 5 % 
sucrose . 

Oxaloacetate hydrolase activity assay: 1000 ml . 0 . 1 M MOPS, pH 
7.5, 2 mM MnCl 2 , 2 5 ml 4 0 mM oxaloacetate, 5 to 10 0 ml sample. 

15 The absorbance was measured at 2 55 nm. The activity was 
determined from the rate of decrease in absorbance and the 
absorption coefficient of oxaloacetate (1.1 mM" 1 cm" 1 , Lenz et 
al , Partial purification and some properties of oxalacetase 
from Aspergillus niger. 1976. Eur. J . Biochem. 65:225-236). The 

20 assay was carried out at 30°C. 

Protein assay: Protein concentrations were measured by using 
the BioRad (Hercules, CA, USA) Bio-Rad Protein Assay ) cat. No. 
500-00 06 following the manufacturers instructions and with 
25 bovine serum albumine as a standard. 



Example 1 

Purification of the Oxaloacetate hydrolase (EC 3.7.1.1) 

Aspergillus niger B01 was fermented in shakeflasks at 3 0°C 
in the medium: sucrose 20 g/L, NaN0 3 15 g/L, KH 2 P0 4 1.5 g/L, 
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MgS0 4/ 7H 2 0 1 g/L, NaCl 1 g/L, CaCl 2/ 2H 2 0 0.1 g/L, trace 
solution 0.5 ml/L. Trace solution: ZnSG 4 , 7H 2 0 14.3 g/L, CuS0 4 , 
5H 2 0 2.5 g/L, NiCl 2 , 6H 2 0 0.5 g/L, FeS0 4 , 7H 2 0 13.8 g/L, MnCl 2 6 
g/L. The pH was 2.5 until a biomass concentration of about 0.5 
g/L was reached. Then the pH was shifted to 6 by addition of 2 
M NaOH and the cells were grown until the biomass concentration 
reached about 5 g/L. The cells were harvested by filtration, 
washed with 0.9% (w/v) NaCl and then frozen in liquid nitrogen. 
The frozen cells were disrupted in a morter under liquid 
nitrogen and then suspended in buffer A. The suspension was 
centrifuged (15 min, 40.000 x g, 4°C) , and the supernatant was 
isolated. Ammoniumsulphate was added to 45% saturation (277 
g/1) , and the formed suspension was centrifuged (15 min, 40.000 
x g, 4°C) , and the supernatant was discharged. The pellet was 
dissolved in buffer A and filtered through a 0.45 Jim filter. 

Ammonium sulfate was added to the sample to a 
conductivity of 50 mS/cm. The sample was then applied onto a 
Phenyl Sepharose High Performance column (Amersham Pharmacia 
Biotech, Uppsala) equilibrated with 0.5 M ammonuim sulfate 
dissolved in buffer A, following the manufacturers 
instructions. The column was washed with the same buffer and 
then eluted using a linear salt gradient from 0.5 M Ammonium 
sulfate dissolved in buffer A to pure buffer A. 

Fractions having oxaloacetate hydrolase activity were 
pooled. This solution was diluted with 2mM MnCl 2 , 2 0 mM DTT, 5 
% sucrose to a conductivity of 4 mS/cm and then applied to a Q 
sepharose High Performance column (Amersham Pharmacia Biotech, 
Uppsala) equilibrated with buffer A. The column was washed with 
buffer A and then eluted using a linear salt gradient from 0 M 
to 0.5 M NaCl dissolved in buffer A. Fractions having 
oxaloacetate hydrolase activity were pooled. 

The samples were applied to PD-10 columns (Amersham 
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Pharmacia Biotech, Uppsala) equilibrated with 10 mM NaH 2 P0 4 pH 
7.2, 0.1 mM MnCl 2 , 5% sucrose, 10 mM DTT and eluted with the 
same buffer. The samples were applied to a Econo-Pac HTP column 
(BioRad, Hercules, CA, USA) that was equilibrated with 10 mM 
NaH 2 P0 4 pH 7.2, 0 . 1 mM MnCl 2 , 5% sucrose, 10 mM DTT. The column 
was washed with 10 mM NaH 2 P0 4 pH 6.8, 0.1 mM MnCl 2 , 5% sucrose, 
10 mM DTT and then eluted with a linear gradient from 10 mM 
NaH 2 P0 4 pH 6.8, 0.1 mM MnCl 2 , 5% sucrose, 10 mM DTT to 400 mM 
NaH 2 P0 4 pH 6.8, 0 . 1 mM MnCl 2 , 5% sucrose, 10 mM DTT. The samples 
having oxaloacetate hydrolase activity were used for SDS-PAGE. 

Example 2 

.Protein sequence determination . of fragments of the 
oxaloacetate hydrolase * * 

The purified oxaloacetate hydrolase of example 1 was subjected 
to SDS- PAGE gel electrophoresis on a 12.5 % gel using standard 
methods (Laemmli, U. K. 1970. Cleavage of structural proteins 
during the assembly of the head of bacteriophage T4 . Nature. 
227:680-685). The proteins were blotted onto a PVDF membrane by 
electroblotting, and the membrane was stained with Coomassie 
brilliant blue R-250 as described by Plough et al (Plough M, 
Jensen AL and Barkholt V. 1989: Anal Biochem. 181, 33-39). 4 
protein bands in the range of an apparent molecular weight of 
38 to 40 kD were excised and applied for amino terminal 
sequencing using a Procise -494 protein sequencer from Perkin 
Elmer - Applied Biosys terns, Norwalk, CT, USA following the 
manufacturers instructions. The following sequences were 
obtained : 



1. MKVDTPDSASTISMTN (SEQ ID NO : 3) 

2. TNT I T I TVEQDG I YE (SEQ ID NO: 4) 
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3. VEQDGIYEIN (SEQ ID NO: 5) 

4. GARQEPWNLNMVTG (SEQ ID NO: 6) . 



Example 3 

Cloning of the gene encoding the oxaloacetate hydrolase 

The protein sequences determined in example 2 were used for 
database searches, and an Aspergillus niger EST sequence was 
retrieved (EMBL, T82752, AN752). This EST encoded a protein 
sequence that could be aligned to the four sequences from 
example 2 with 100 % identity. The EST sequence was used to 
design the following PGR primers: 

Oxalac EST sense: 5' AAA GTT GAT ACC CCC GAT TCT 3' (SEQ ID NO: 
7) 

Oxalac EST antisense: 5' ATG GCA ATA CGG GGA CAG ACC 3' (SEQ 
ID NO: 8) . 



These primers were used to amplify DNA from A. niger BO 1 
prepared essentially as described by Leach et . al . (J. Leach, 
D. B. Finkelstein and J. A. Rambosek (1986) Fungal gent, newsl 
33, 32-33) using amplitaq taq polymerase from Perkin Elmer 

(Norwalk, CT, USA) following the manufacturers instructions. 
The PCR reaction was run in an MJ PCT 150 capillary PCR cycler 

(MJ research, Watertown, MA, USA in a volume of 10 (il running 
30 cycles with a denaturation temperature of 94°C for 10 sec, 
an annealing temperature of 55°C for 10 sec and an elongation 
temperature of 72°C for 30 sec with a temperature gradient from 
denaturation to annealing of -0.5°C/sec. 

The obtained 219 bp fragment was ligated into the pCR II 
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vector (Invitrogen, Carlsbad, CA, USA) following the 
manufacturers instructions and transformed into Escherichia 
coli DH 5a as described by Sambrook (J. Sambrook et al . (1989) 
Molecular cloning, a laboratory manual, second edition, Cold 
5 Spring Harbor Laboratory Press) . A clone containing an insert 
of the expected size was sequenced using the primers: 

M13 reverse primer: 5' CAG GAA ACA GCT ATG AC 3' (SEQ ID 

NO: 9) . 

io Ml 3 Forward (-20) primer: 5' GTA AAA CGA CGG CCA G 3' (SEQ 

ID NO: 10). 

The DNA sequence analysis was made using an ABI PRISM 377 
DNA Sequencer together with the ABI PRISM BigDye Terminator 

15 Cycle Sequencing Ready Reaction Kit (Perkin Elmer Applied 
Biosystems, Foster city, CA, USA) following the manufacturers 
recommendations. The sequences confirmed the fragment to be the 
desired sequence. 

A restriction map was established by Southern blot analysis 

20 using the 219 ~ bp insert as a probe following the protocols of 
Sambrook et al . From this restriction map it was concluded that 
the gene was located on a 5 kb Bgl II fragment. 

The gene was then cloned by inverse PCR: Genomic DNA from A. 
niger BO 1 was digested with Bgl II, and fragments ranging from 

25 4 kb . to 6 kb were recovered from an agarose gel. The DNA was 
ligated using T4 DNA ligase at 16 °C. This ligation mixture was 
used as template in a PCR reaction using the following primers: 

OXEST1 5' GAC GGT CTG TCG CCG TAT TGC 3' (SEQ ID NO: 11) 

30 OXEST2 5' GGA AGC AGA ATC GGG GGT ATC 3' (SEQ ID NO: 12) . 



The Expand high fidelity polymerase (Boehringer Mannheim, 
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Mannheim, Germany) was used for the amplification following the 
manufacturers recommendations. The MgCl2 concentration in the 
reaction was 1 mM. Otherwise the conditions were as above. A 5 
kb fragment was generated. This fragment was digested with Bgl 
II, and the digested fragment was ligated to pUC 19 (c- 
Yanisch-Perron et al (1985) Gene 33, 103-119) digested with 
BamH I and Hinc II. The ligation mixture was transformed into' 
E. coli DH 5a. Colonies having insert sizes of 1.3 kb and 3.7 
kb were found. One colony of each class were sequenced using 
the M13 reverse primer and the M13 Forward (-20) primer. In 
this way the Bgl II neighboring sequences were determined, and 
the following primers were designed to PCR amplify the 5 kb Bgl 
II fragment: 

Flank 1: 5' GCG GCC GCG CGC CAA TAA CGT CCG ATT C 3' (SEQ ID 
NO: 13) 

Flank 2: 5' GCG GCC GCC TAC AAA TAC ATT GAC CTC CC 3' (SEQ ID 
NO: 14) . 



These primers were used to amplify genomic A. niger BO 1 DNA 
using the same PCR conditions as for the inverse PCR. The 
generated 5 kb fragment was ligated into pCR II to form pHP I. 
The oxaloacetate hydrolase gene was sequenced using pHP I as 
template with the following primers: 

Oxalac EST sense: 5' AAA GTT GAT ACC CCC GAT TCT 3' (SEQ ID 
NO: 15) 
Oxalac EST 

antisense: 5' ATG GCA ATA CGG CGA CAG ACC 3' (SEQ ID 

NO : 16) 

OXEST 1: 5 ' GAC GGT CTG TCG CCG TAT TGC 3' (SEQ ID 

NO : 17) 
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OXEST 2 : 
NO: 18) 
OXEST 3 : 
NO : 19) 
OXEST 4 : 
NO: 20) 
0X5 : 
NO : 21) 
OX6 : 
NO: 22) 
OX7 : 
NO: 23) 
OX8 : 
NO: 24) 
OX9 : 

ID NO: 2 5) 
OX10 : 
NO: 26) 
OX11 : 
NO: 27) 
0X12 : 
NO : 2 8) 
OX13 : 
NO : 2 9) 
OX14 : 
NO : 3 0) 
OX15: 
NO : 31) 
FLANKEl : 

(SEQ ID NO: 32) 

FLANKE2 : 

3' (SEQ ID NO: 33) 
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5' GGA AGC AG A ATC GGG GGT ATC 3' (SEQ ID 

5' GCC GGA GTC GCG GGA TTC CAC 3' (SEQ ID 

5' GGC GGA CTA TGA TTT GTG CC 3' ( SEQ ID 

5' TGA TGG TCG CCC GTT CCG TT 3' (SEQ ID 

5' TGC CAT TCA ATT TTC TTG GCC 3' (SEQ ID 

5' TGA TCT TCG ATG TGG AAT CCC 3' (SEQ ID 

5' GAT GGC GTC GAT TGA CCA TTT 3' (SEQ ID 

5' GGA GAT GGG TTT GCT AAT GGT GTT 3' (SEQ 

5' TTA GCA AAC CCA TCT CCA CC 3' (SEQ ID 

5' CGA ATT ACT GGT CAT TAG CCC 3' (SEQ ID 

5' CGA GAG AAG TAT TCT AGA CCC 3' (SEQ ID 

5' TGA CTG TCG ATC AGG GTG TT 3' (SEQ ID 

5' GTG TGC GGA TTG ATG GAC TC 3' (SEQ ID 

5' CAA CCC AAC TCA ACA ACT CT 3' (SEQ ID 

5' GCG GCC GCG CGC CAA TAA CGT CCG ATT C 3' 

5' GCG GCC GCC TAC AAA TAC ATT GAC CTC CC 
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The DNA sequence is shown in SEQ ID NO 1. The sequence was 
analyzed for coding sequence and for the presence of introns 
using the computer software Netgene 2 (S.M. Hebsgaard, P.G. 
Korning, N. Tolstrup, J. Engelbrecht, P. Rouze, S. Brunak 
(1996. Nucleic Acids Research 24: 3439-3452) suggested the 
existence of 3 exons (see annotations to SEQ ID NO 1) The 341 
residue protein sequence deduced from these 3 introns is shown 
in SEQ ID NO 2 . 

Data base searches using this protein sequence (oah) as the 
query sequence revealed sequence homology to Isocitrate lyase 
from Aspergillus nidulans (Swiss prot P28298) and Neurospora 
crassa (Swiss prot P28299) as well as carboxyvinyl - 
carboxyphosphonate phosphorylmutase from Streptomyces 

hygroscopicus (Swiss prot P11435) and a hypothetical protein 
from Bacillus subtilis (Swiss prot P54528) 



oah P28298 P28299 



oah 


100. 


0 


20 


.2 


21 . 


0 


P28298 


20. 


2 


100 


. 0 


74 . 


3 


P28299 


21. 
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74 


.3 


100. 


0 


P11435 
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.3 


18 . 
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A strain of E. coli harbouring 
as DSM 12660. 



P11435 


P54528 


29 . 7 


29 . 1 


17 . 6 


19.3 


18 . 1 


18 . 6 


100 . 0 


36.5 


36.5 


100.0 



the plasmid pHPl was deposited 



Example 4 

Disruption of the oxaloacetate hydrolase gene 

pHP I was digested with Nru I and BstE II and the 6.6 kb 
fragment was isolated. A plasmid harboring the Aspergillus 
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niger pyrG gene, pJRoy 10 (figure 1), was digested with Nco I, 
and the 5' recessive ends were then filled in with the Klenow 
fragment of DNA polymerase I , and then digested with BstE II. 
The 2948 bp fragment was isolated and ligated to the pHP I 
5 fragment. The ligation mixture was used to transform E. coli 
DH5a . A colony harboring the desired plasmid was identified, 
and the plasmid was termed pHP 3 (figure 2) . A strain of E. 
coli harbouring the plasmid pHP3 was deposited as DSM 12661. 
The pyrG deficient A. niger BO 1 derivative JRoy3 (BOl was 

10 rendered pyrG negative by transforming with a deletion fragment 
of the pyrG gene region and selecting on FOA) was transformed 
with the 5 kb EcoR I - Not I fragment of pHP 3 using the method 
described in the EP patent EP 0 531 372 Bl . 275 transf ormants 
were reisolated twice and then grown up in 96 well microtiter 

15 dishes at . 34°C, pH 6.0 for 48 hr in the medium glucose 20 g/L, 
NaN0 3 15 g/L, KH 2 P0 4 1.5 g/L, MgS0 4 , 7H 2 0 1 g/L, NaCl 1 g/L, 
CaCl 2 , 2H 2 0 0.1 g/L, tracer solution 0.5 ml/L. Tracer solution: 
ZnS0 4 , 7H 2 0 14.3 g/L, CuS0 4 , 5H 2 0 2.5 g/L, NiCl 2 , 6H 2 0 0.5 g/L, 
FeSO,, 7H 2 0 13.8 g/L, MnCl 2 6 g/L. The supernatants were assayed 

20 for oxalate using the Sigma (St. Louis, MO, USA) oxalate (cat 
no 591-C) kit. 8 transf ormants were found not to produce 
oxalate. 6 of those were subjected to Southern blot analysis 
together with 2 oxalate producing transf ormants , BO 1 and JRoy 
3 as positive controls. The 219 bp EST PCR fragment described 

25 in example 3 was used as probe. A seperate blot of the same 
samples were probed with the Nco I - BstE II fragment of pJRoy 
10. The Southern blots revealed that the 6 oxalate negative 
strains were disrupted in the oxaloacetate gene, whereas the 
positive controls contained intact oxaloacetate genes. 

30 

Example 5 

Fermentation of an oxalate negative strain 
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One of the oxalate negative strains were grown in a batch 
fermenter equipped with stirring, temperature control, pH 
control and aeration. The medium was glucose 16 g/L, (NH4) 2 S0 4 
7.5 g/L, KH 2 P0 4 1.5 g/L, MgSO, , 7H 2 0 1 g/L, NaCl 1 g/L, CaCl 2 , 
2H 2 0 0.1 g/L, trace solution 0.5 ml/L. Trace solution: ZnS0 4 , 
7H 2 0 14.3 g/L, CuS0 4 , 5H 2 0 2.5 g/L, NiCl 2 , 6H 2 0 0.5 g/L, FeS0 4 , 
7H 2 0 13.8 g/L, MnCl 2 6 g/L. The pH was 2.5 until the biomass 
had reached a concentration of 0.5 g/L. At this point the pH 
was shifted to pH 6.0. The fermentation broth was analyzed for 
oxalate, citrate, pyruvate, succinate, acetate, glycerol, 
ethanol and glucose (Nissen, T. L., U. Schulze, J. Nielsen, and 
J. Villadsen. 1997. Flux distributions in anaerobic, glucose- 
limited continuous cultures of Saccharomyces cerevisiae. 
Microbiol. 143:203-218). Of bi-products only citrate, pyruvate, 
succinate and glycerol could be detected and the maximum 
concentrations of these bi-products were 0.74 g/L, 0.18 g/L, 
0.14 g/L and 0.13 g/L respectively. The biomass yield on 
glucose was 0.58 g/g and the specific growth rate was 0.23 h" 1 . 

Deposit of Biological Material 

The following biological material has been deposited under 
the terms of the Budapest Treaty with the DSMZ-Deutsche 
Sammlung von Mikroorganismen und Zellkulturen GmbH, Mascheroder 
Weg lb, D-38124 Braunschweig, Germany, and given the following 
accession numbers: 



Depositor's ref: 
NN049454 
NN049455 
NN049047 



Accession Number 
DSM 12660 
DSM 12661 
DSM 12665 



Date of Deposit 
1999-02-03 
1999-02-03 
1999-02-03 
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The strain has been deposited under conditions that assure 
that access to the culture will be available during the 
pendency of this patent application to one determined by the 
Commissioner of Patents and Trademarks to be entitled thereto 
5 under 37 C.F.R. §1.14 and 35 U.S.C. §122. The deposit 

represents a substantially pure culture of the deposited 
strain. The deposit is available as required by foreign patent 
laws in countries wherein counterparts of the subject 
application, or its progeny are filed. However, it should be 

10 understood that the availability of a deposit does not 
constitute a license to practice the subject invention in 
derogation of patent rights granted by governmental action. 

The invention described and claimed herein is not to be 
limited in scope by the specific embodiments herein disclosed, 

is since these embodiments are intended as illustrations of 
several aspects of the invention. Any equivalent embodiments 
are intended to be within the scope of this invention. Indeed, 
various modifications of the invention in addition to those 
shown and described herein will become apparent to those 

20 skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of the 
appended claims. In the case of conflict, the present 

disclosure including definitions will control. 

Various references are cited herein, the disclosures of 

25 which are incorporated by reference in their entireties. 
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Claims 



1. An isolated nucleic acid sequence encoding a polypeptide 
having oxaloacetate hydrolase activity, selected from the group 
consisting of : 

(a) a nucleic acid sequence encoding a polypeptide having 
an amino acid sequence which has at least 65% identity with 
amino acids 1 to 341 of SEQ ID NO: 2; 

(b) a nucleic acid sequence having at least 65% homology 
with a nucleotide sequence constituted by nucleotides 1157- 
1411, 1504-1651 and 1764-2383 of SEQ ID NO : 1 ; 

(c) a nucleic acid sequence which hybridizes under medium 
stringency conditions with (i) the nucleic acid sequence of SEQ 
ID NO:l, (ii) the cDNA sequence of SEQ ID NO : 1 , (iii) a 
subsequence of (i) or (ii) of at least 100 nucleotides, or (iv) 
a complementary strand of (i) , (ii) , or (iii) ; 

(d) an allelic variant of (a) , (b) , or (c) ; 

(e) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has 
oxaloacetate hydrolase activity; 

2. The nucleic acid sequence of claim 1, which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID NO: 2. 

3. The nucleic acid sequence of claim 1 which has at least 
65% homology with the nucleic acid sequence of SEQ ID N0:1. 

4. The nucleic acid sequence of claim 1, which has the 
nucleic acid sequence of SEQ ID NO : 1 . 

5. The nucleic acid sequence of claim 1, which is contained 
in the plasmid pHP 1 which is contained in Escherichia coli DSN! 
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12660. 

6. An isolated nucleic acid sequence produced by (a) 
hybridizing a DNA under medium stringency conditions with (i) 
the nucleic acid sequence of SEQ ID NO : 1 , (ii) the cDNA 
sequence of SEQ ID NO : 1 , (iii) a subsequence of (i) or (ii) of 
at least 100 nucleotides, or (iv) a complementary strand of 
(i) , (ii) , or (iii) ; and (b) isolating the nucleic acid 
sequence . 

7. A nucleic acid construct comprising the nucleic acid 
sequence of any of claims 1-6 operably linked to one or more 
control sequences which direct . the production of the 
polypeptide in a suitable expression host. 

8. A recombinant expression vector comprising the nucleic 
acid construct of claim 7, a promoter, and transcriptional and 
translational stop signals. 

9. A recombinant host cell comprising the nucleic acid 
construct of claim 7 . 

10. A method for producing a polypeptide having oxaloacetate 
hydrolase activity comprising (a) cultivating the host cell of 
claim 9 under conditions suitable for production of the 
polypeptide; and (b) recovering the polypeptide. 

11. A method for producing a mutant of a cell, which 
comprises disrupting or deleting the nucleic acid sequence of 
any of claims 1-6 or a control sequence thereof, which results 
in the mutant producing less of the oxaloacetate hydrolase than 
the cell . 
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12. The method of claim 11, comprising: 

(a) introducing into a parent cell, in particular a parent 
filamentous fungal cell a nucleic acid sequence comprising a 
modification of at least one of the genes responsible for the 
production of a oxaloacetate dehydrolase ; and 

(b) identifying the mutant from step (a) , wherein the mutant 
produces less of the oxaloacetate dehydrolase than the parent 
cell of the mutant cell when cultured under the same 
conditions . 

13. A mutant cell produced by the method of any of claims 11 
or 12 . 

14. The mutant according to claim 13 which is a filamentous 
fungal cell. 

15. The mutant according to claim 14 which is a cell of 
Aspergillus, such as A. niger, Acremonium, Aureobasidium, 
Cryptococcus, Filibasidium, Fusarium, Gibberella, Humicola, 
Magnaporthe, Mucor, Myceliophthora, Myrothecium, 
Neocallimastix, Neurospora, Paecilomyces , Penicillium, 
Piromyces, Schizophyllum, Talaromyces, Thermoascus, Thielavia, 
Tolypocladium, or Trichoderma strain. 

16. The mutant according to any of claims 11-15, wherein the 
mutant cell produces at least about 25% less of the 
oxaloacetate hydrolase or oxalic acid than the parent cell when 
cultured under identical conditions. 

17. The mutant of claim 16, wherein the mutant cell produces 
no oxaloacetate hydrolase or oxalic acid. 
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18. The mutant of any of claims 11-17, wherein the mutant cell 
further comprises one or more modifications of one or more 
third nucleic acid sequences, wherein the modification reduces 
or eliminates expression of the one or more third nucleic acid 
sequences . 

19. The mutant of claim 18, wherein the third nucleic acid 
sequence encodes an enzyme selected from the group consisting 
of an aminopeptidase , amylase, carbohydrase , carboxypept idase , 
catalase, cellulase, chitinase, cutinase, cyclodextrin 
glycosyltransf erase , deoxyribonuclease , esterase , alpha - 
galact os idase , beta -gal actos idase , glucoamylase , alpha - 
glucosidase, beta-glucosidase , invertase, laccase, lipase, 
mannosidase, mutanase, oxidase, a pectinolytic enzyme, 
peroxidase, phytase, polyphenoloxidase , proteolytic enzyme, 
ribonuclease, transglutaminase, and xylanase. 

20. The mutant of any of claims 11-19, wherein the mutant cell 
further comprises a modification of one or more genes which 
encode a protease. 



21. A method for producing a homologous product comprising (a) 
cultivating the mutant of any of claims 11-20 under conditions 
conducive for production of the product, and (b) recovering the 
product . 

22. The method according to claim 21, wherein the homologous 
product is citric acid. 

23. The mutant of any of claims 11-20, which further comprises 
a nucleic acid sequence encoding a heterologous polypeptide. 
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24. A method for producing a heterologous polypeptide 
comprising (a) cultivating the mutant of claim 23 under 
conditions conducive for production of the polypeptide; and (b) 
recovering the polypeptide. 

25. The method according to claim 24, wherein the heterologous 
polypeptide is a hormone, a hormone variant, an enzyme, a 
receptor or a portion thereof, an antibody or a portion 
thereof, or a reporter. 

26. The method of claim 25, wherein the enzyme is an 
oxidoreductase, a transferase, a hydrolase, a lyase, an 
isomerase, or a ligase, preferably an aminopeptidase, amylase, 
carbohydrase , carboxypept idase , catalase, cellulase, chitinase, 
cutinase, cyclodextrin glycosyltransf erase , deoxyribonuclease , 
esterase, alpha-galactosidase , beta-galactosidase , 
glucoamylase, alpha-glucosidase , beta-glucosidase , invertase, 
laccase, lipase, mannosidase, mutanase, oxidase, a pectinolytic 
enzyme, peroxidase, phytase, polyphenoloxidase , proteolytic 
enzyme, ribonuclease , transglutaminase, or xylanase . 

27. Use of an oxaloacetate dehydrolase encoded by a nucleic 
acid sequence according to any of claims 1-6 as a diagnostic 
enzyme . 

28. A substantially pure culture of E. coli DSM 12660 or DSM 
12661 . 
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Construction of pHP 3. 



Fig. 2 
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SEQUENCE LISTING 



<110> Novo Nordisk A/S 

<120> 

<130> 

<160> 33 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3494 
<212> DNA 

<213> Aspergillus niger 

<22 0> 
<221> CDS 

<222> (1157) . . (1411) 
<220> 

<221> intron 

<222> (1412) . . (1503) 

<220> 

<221> intron 
<222> (1) . . (1156) 

<220> 
<221> CDS 

<222> (1504) . . (1651) 

<220> 
<221> CDS 

<222> (1764) . . (2383) 
<400> 1 

tgctccctcg gccaagcgcc aataacgtcc gattcatccc attcctcgtc cagctggcga 60 
actccggagg ttgattgctc gctcgctctc agttggccac caaacttact cgtccccctc 12 0 
cttcaccctc cctcctctgc caatgctaca gagtacttgg ctaggctact atcttctcag 180 
ct 999tgaag aacaacgggc cccgtgcgtg atgagcaaaa gcgtctgaca tgcagcaact 24 0 
gcagtatact ggagcccgcg gctaccgagg aactcgtgct cgtgtgccac cacatcgaag 300 
tgagttgatg cgtcttgtcc atgcagtgtc ggcgtggcct aaagtacggg ccaaacctgt 360 
ctgacttcat cccacactat taccccctcc ctcattctcc cctgattcgg cccaataagg 420 
aaatcactta gtcaatcaat cctgccatta ccggcgcgta atctgaaact acgcgcggac 4 80 
tgtctcttac tcccctcgcg gtgggcggcc cagccagccc catccttact agatttagcg 540 
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aattactggt cattagccct gtacggggga ggggcgggaa aacaaaaatg cgaataatag 6 00 

aataaattta ataaagaaaa aagagggggg gggagcttat ctaggcccct gctgcattgc 660 

attcggacat ttttcgactt gtcacaggca caaatcatag tccgccgatg gcgtcgattg 720 

accattttct tttcttttct cggcgctggg atggtggcca agaaaattga atggcaatgg 780 

ttcgttcacc ggagtagggt gtacgtgcat tgtgtggatt gacgatgatt ctcggccaag 84 0 

ggcttgcgtt gcaatcccac caggagggga atgttgcaga cagacagaaa gcaaaagaag 900 

tattggaggg aaaaaaacaa ttcttgaaaa atgatcttct caggtaatga atattggttg 960 

ctggcgggct gatcttctcc cgacacgtct atataaactg gtcaccttct ggcccttcct 1020 

ttctatctct tccttctcat catcagtctc aaacaagcct ctttctctcc taccttcact 1080 

ctccactttc tcctttcgaa agggataaaa ctctcctcct cattctcacc tatatatacc 1140 

ttgtgctttt ctcgca atg aaa gtt gat acc ccc gat tct get tec acc ate 1192 

Met Lys Val Asp Thr Pro Asp Ser Ala Ser Thr lie 
1 5 10 

age atg acc aac act ate acc ate acc gta gag cag gac ggt ate tat 1240 
Ser Met Thr Asn Thr lie Thr lie Thr Val Glu Gin Asp Gly lie Tyr 
15 20 25 

gag ate aac ggt gee cgt caa gag ccc gtg gtc aac ctg aac atg gtc 1288 
Glu lie Asn Gly Ala Arg Gin Glu Pro Val Val Asn Leu Asn Met Val 
30 35 40 

acc ggt gcg age aaa ctg cgc aag cag ctt cgc gag acc aat gag ttg 1336 
Thr Gly Ala Ser Lys Leu Arg Lys Gin Leu Arg Glu Thr Asn Glu Leu 
45 50 55 60 

etc gtg tgt cct ggt gtg tac gac ggt ctg tec gee cgt att gee ate 1384 
Leu Val Cys Pro Gly Val Tyr Asp Gly Leu Ser Ala Arg lie Ala He 

65 70 * 75 

aac ctg ggc ttc aag ggc atg tac atg gtatgttgga ttccttagac 1431 
Asn Leu Gly Phe Lys Gly Met Tyr Met 
80 * 85 

tacctttccc cacagtcaac acttctccgc ttccgcgatg gagaaaaaag atcatactaa 1491 

eggaaaggtc ag acc ggc gee ggt act acc gcg tct aga ctg ggc atg gee 1542 
Thr Gly Ala Gly Thr Thr Ala Ser Arg Leu Gly Met Ala 
90 95 

gat ctg ggt eta gee cac ate tac gac atg aag acc aac gca gag atg 1590 
Asp Leu Gly Leu Ala His He Tyr Asp Met Lys Thr Asn Ala Glu Met 
100 105 110 

ate gcg aac ctg gac ccc tac ggt cct ccc ctg ate gca gac atg gac 1638 
He Ala Asn Leu Asp Pro Tyr Gly Pro Pro Leu He Ala Asp Met Asp 
115 120 125 ' 130 
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act ggc tac gga g gtgagaatcc cccatctcca ctgtctgcca agacataatg 1691 
Thr Gly Tyr Gly G 

atctacccgc gccaaaaagc aaaacggcaa tatagaccca gttccccact aacaccaaaa 1751 

aaacaaaaat ag gc ccc ctg atg gtc gcc cgt tec gtt caa caa tac ate 1801 
ly Pro Leu Met Val Ala Arg Ser Val Gin Gin Tyr lie 
135 140 145 

caa gcc gga gtc gcg gga ttc cac ate gaa gat cag ate caa aac aag 184 9 
Gin Ala Gly Val Ala Gly Phe His lie Glu Asp Gin lie Gin Asn Lys 
150 155 160 

cga tgc gga cac ctg gca ggc aag cgc gtc gtc ace atg gac gaa tac 1897 
Arg Cys Gly His Leu Ala Gly Lys Arg Val Val Thr Met Asp Glu Tyr 
165 170 175 

ttg act cgc ate cgc gcc gcc aag etc ace aag gac cgc etc cgc age 1945 
Leu Thr Arg lie Arg Ala Ala Lys Leu Thr Lys Asp Arg Leu Arg Ser 
180 185 190 195 

gac ate gtg ctg att gcc cgc ace gac gcc etc cag cag cac ggc tac 1993 
Asp lie Val Leu lie Ala Arg Thr Asp Ala Leu Gin Gin His Gly Tyr 
200 205 210 

gac gag tgc att cgc cgc ctt aag gcc gcc cgc gat ctt ggc gcc gat 2041 
Asp Glu Cys lie Arg Arg Leu Lys Ala Ala Arg Asp Leu Gly Ala Asp 
215 220 225 

gtt ggt etc etc gag ggc ttc ace agt aag gag atg gcg agg egg tgt 2089 
Val Gly Leu Leu Glu Gly Phe Thr Ser Lys Glu Met Ala Arg Arg Cys 
230 235 240 

gtc cag gac ctt gcg cct tgg ccg ctt ctg etc aac atg gtg gag aac 2137 
Val Gin Asp Leu Ala Pro Trp Pro Leu Leu Leu Asn Met Val Glu Asn 
245 250 255 ' 

ggt get ggg ccg gtt att tec gtc gat gag get agg gaa atg ggc ttc 2185 
Gly Ala Gly Pro Val lie Ser Val Asp Glu Ala Arg Glu Met Gly Phe 
260 265 270 275 

cgc att atg ate ttc teg ttc get tgc att act cct gcc tat atg ggg 2233 
Arg lie Met lie Phe Ser Phe Ala Cys lie Thr Pro Ala Tyr Met Gly 
280 285 290 

att ace get get ctg gag agg etc aag aag gat ggt gtg gtt ggg ttg 2281 
lie Thr Ala Ala Leu Glu Arg Leu Lys Lys Asp Gly Val Val Gly Leu 
295 300 305 

ccc gag ggg atg ggg ccg aag aag ctg ttt gag gtg tgc gga ttg atg 232 9 
Pro Glu Gly Met Gly Pro Lys Lys Leu Phe Glu Val Cys Gly Leu Met 
310 315 320 

gac teg gtg agg gtt gat acc gag get ggt gga gat ggg ttt get aat 2377 
Asp Ser Val Arg Val Asp Thr Glu Ala Gly Gly Asp Gly Phe Ala Asn 
325 330 335 



WO 00/50576 



PCT/DKOO/00067 



ggt gtt taattctttt ctttttttga ttcttaattc cctggttgtt ttgttgtgaa 2433 

Gly Val 

340 

agtttcttat ttttctggtt tgttttattt ccccttctgg taactaattt tgtgtgagaa 2493 
agagttgttg agttgggttg aactgcattg gatgggattg atttattttc gggatcaaag 2553 
tgaaaggaag ggaagggggc tgtgttattg gttttcgagt ggggaccgat atattcctac 2613 
tatacatatc gaagcttgcg tggtacatat actagtatct actacattac caagaatgga 2673 
aatgaaaact gggtgttaga tttcagttga caggtcttat gttcgtttac cgatagagta 2733 
attcctgctt ctcactccat gtgagcccaa tcacaatgga attgtaatct ggttgcctta 2793 
taagtactta gtactctgta ctctgtacta cttctcgcat cacatcaaat cttaatactt 2853 
agtacgtagt ttgtttcacc cagcaaaacc ttattgcctt aacaatcata ttctcagtaa 2913 
gcacgagaca cagaaacgag agaagtattc tagaccctga cagaacwccc tgatcgacag 2 973 
tcacttaccc aacaaagtaa gtggtctcta ccctctgatt acagttaagg caggcagtag 3 033 
taagcaagaa gaagaaagaa agaataatta actactaagt ttctcactac tgcatgcacg 3 093 
accacggagt cgccgtgcaa aaaaattggt gcgtgctcag ctagctgcac tctgcacact 3153 
gccaccctcg ccctacaaaa gaaaccatgc tgtttctcca ctatactgtt cccgcgatga 3213 
aactagggcc aataaccatg cagttactat tggtcccact ggggtgggtt gggtagcctt 3273 
atggtattaa aaggagtagg ggtctttgtc gatcgttttc tgttttcttt ttgkattttt 3333 
atttytgttg gwctctgttt gtgttgtgtt gggccgnttt tgttttcttt gggtaacgag 3393 
ggatgggaat atattcatat ggaaatggaa atggattatg ctattgattg atgaatggtg 3453 
atgatctgcg tggaaattaa tgtcagagtc ttgmtgattc a 3494 

<210> 2 
<211> 341 
<212> PRT 

<213> Aspergillus niger 
<400> 2 

Met Lys Val Asp Thr Pro Asp Ser Ala Ser Thr He Ser Met Thr Asn 
1 5 io 



15 



Thr He Thr He Thr Val Glu Gin Asp Gly He Tyr Glu He Asn Gly 

20 25 ' 30 

Ala Arg Gin Glu Pro Val Val Asn Leu Asn Met Val Thr Gly Ala Ser 

35 40 45 
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Lys Leu Arg Lys Gin Leu Arg Glu Thr Asn Glu Leu Leu Val Cys Pro 
50 55 60 

Gly Val Tyr Asp Gly Leu Ser Ala Arg lie Ala lie Asn Leu Gly Phe 
65 70 75 80 

Lys Gly Met Tyr Met Thr Gly Ala Gly Thr Thr Ala Ser Arg Leu Gly 
85 90 95 

Met Ala Asp Leu Gly Leu Ala His lie Tyr Asp Met Lys Thr Asn Ala 
100 105 110 

Glu Met lie Ala Asn Leu Asp Pro Tyr Gly Pro Pro Leu lie Ala Asp 
115 120 125 

Met Asp Thr Gly Tyr Gly Gly Pro Leu Met Val Ala Arg Ser Val Gin 
130 35 140 

Gin Tyr lie Gin Ala Gly Val Ala Gly Phe His lie Glu Asp Gin lie 
145 150 155 160 

Gin Asn Lys Arg Cys Gly His Leu Ala Gly Lys Arg Val Val Thr Met 
165 170 175 

Asp Glu Tyr Leu Thr Arg lie Arg Ala Ala Lys Leu Thr Lys Asp Arg 
180 185 190 

Leu Arg Ser Asp lie Val Leu lie Ala Arg Thr Asp Ala Leu Gin Gin 
195 200 205 

His Gly Tyr Asp Glu Cys lie Arg Arg Leu Lys Ala Ala Arg Asp Leu 
210 215 220 

Gly Ala Asp Val Gly Leu Leu Glu Gly Phe Thr Ser Lys Glu Met Ala 
225 230 235 240 

Arg Arg Cys Val Gin Asp Leu Ala Pro Trp Pro Leu Leu Leu Asn Met 
245 250 255 

Val Glu Asn Gly Ala Gly Pro Val lie Ser Val Asp Glu Ala Arg Glu 
260 265 270 

Met Gly Phe Arg lie Met lie Phe Ser Phe Ala Cys lie Thr Pro Ala 
275 280 285 

Tyr Met Gly lie Thr Ala Ala Leu Glu Arg Leu Lys Lys Asp Gly Val 
290 295 300 

Val Gly Leu Pro Glu Gly Met Gly Pro Lys Lys Leu Phe Glu Val Cys 
305 310 315 320 

Gly Leu : Met Asp Ser Val Arg Val Asp Thr Glu Ala Gly Gly Asp Gly 
325 330 335 



Phe Ala Asn Gly Val 
340 
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<210> 3 
<211> 16 
<212> PRT 

<213> Aspergillus niger 
<220> 



<223> Description of the Sequence: Partial protein sequence 
<400> 3 

Met Lys Val Asp Thr Pro Asp Ser Ala Ser Thr lie Ser Met Thr Asn 
15 io -- 



15 



<210> 4 
<211> 15 
<212> PRT 

<213> Aspergillus niger 
<220> 

<223> Description of the Sequence: Partial protein sequence 
<400> 4 

Thr Asn Thr He Thr He Thr Val Glu Gin Asp Gly He Tyr Glu 
1 5 10 15 



<210> 5 
<211> 10 
<212> PRT 

<213> Aspergillus niger 
<220> 

<223> Description of the sequence: Partial protein sequence 
<400> 5 

Val Glu Gin Asp Gly He Tyr Glu He Asn 
1 5 10 



<210> 6 
<211> 15 
<212> PRT 

<213> Aspergillus niger 
<220> 

<223> Description of the Sequence: Partial protein sequence 
<400> 6 

Gly Ala Arg Gin Glu Pro Val Val Asn Leu Asn Met Val Thr Gly 
1 5 io -1 



15 
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<210> 7 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxalac EST sense 
<400> 7 

aaagttgata cccccgattc t 21 



<210> 8 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxalac EST antisense 
<400> 8 

atggcaatac ggggacagac c 21 



<210> 9 
<211> 17 
<212> DNA 

<213> Artificial Sequence . . 

<220> 

<223> Description of Artificial Sequence: M13 reverse primer 
<400> 9 

caggaaacag ctatgac 17 

<210> 10 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223 > Description of Artificial Sequence: M13 Forward (-20) primer 
<400> 10 

gtaaaacgac ggccag 16 

<210> 11 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OXEST1 
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<400> 11 

gacggtctgt cgccgtattg c 

<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OXEST2 
<400> 12 

ggaagcagaa tcgggggtat c 

<210> 13 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Flank 1 
<400> 13 

gcggccgcgc gccaataacg tccgattc 

<210> 14 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Flank 2 
<400> 14 

gcggccgcct acaaatacat tgacctccc 

<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oxalac EST sense 
<400> 15 

aaagttgata cccccgattc t 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Oxalac EST antisense 



<210> 17 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OXEST 1 

<400> 17 

gacggtctgt cgccgtattg c 21 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OXEST 2 



<210> 19 

<211> 21 

<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: OXEST 3 

<400> 19 

gccggagtcg cgggattcca c 21 

<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OXEST 4 



<400> 16 

atggcaatac ggcgacagac c 



21 



<400> 18 

ggaagcagaa tcgggggtat c 



21 



<400> 20 

ggcggactat gatttgtgcc 



20 



<210> 
<211> 



21 
20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX5 
<400> 21 

tgatggtcgc ccgttccgtt 

<210> 22 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX6 
<400> 22 

tgccattcaa ttttcttggc c 

<210> 23 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX7 

<400> 23 

tgatcttcga tgtggaatcc c 

<210> 24 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX8 
<400> 24 

gatggcgtcg attgaccatt t 

<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX9 
<400> 25 

ggagatgggt ttgctaatgg tgtt 
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<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX10 
<400> 26 

ttagcaaacc catctccacc 20 

<210> 27 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX11 



<210> 28 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX12 
<400> 28 

cgagagaagt attctagacc c 21 

<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX13 



<400> 27 

cgaattactg gtcattagcc c 



21 



<400> 29 

tgactgtcga tcagggtgtt 



2 0 



<210> 30 
<211> 20 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: OX14 



WO 00/50576 



PCT/DKOO/00067 



12 

<400> 30 

gtgtgcggat tgatggactc 

<210> 31 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: OX15 
<400> 31 

caacccaact caacaactct 

<210> 32 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLANKE1 

<400> 32 

gcggccgcgc gccaataacg tccgattc 

<210> 33 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLANKE2 



<400> 33 

gcggccgcct acaaatacat tgacctccc 
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